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00 (54) Title: BSTP-RAS/RERG PROTEIN AND RELATED REAGENTS AND METHODS OF USE THEREOF 
00 

^ (57) Abstract: The invention provides a protein (BSTP-Ras/Rerg) and polynucleotides which identify and encode BSTP-Ras. The 
invention also provides expression vectors, host cells, antibodies, agonists, and antagonists. In addition, the invention provides 

^2 methods for treating or preventing disorders of cell proliferation, particularly breast cancer, by administering a pharmaceutical com- 
position comprising a polypeptide, polynucleotide, or antibody of the invention. The invention also provides methods of classifying 

^ diseases, particularly breast cancer, by detecting expression of BSTP-Ras or of a polynucleotide encoding BSTP-Ras. and of pro- 

^ viding diagnostic, prognostic, and/or predictive information for a patient based on the detection and/or measurement of BSTP-Ras 

^ or of a gene that encodes BSTP-Ras or of a mutation in the BST-RAS/Rerg gene. 
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BSTP-RAS/Et£RG PROTEIN AND RELATED REAGENTS AND METHODS 

OF USE THEREOF 

GOVERNMENT SUPPORT 
5 The U.S. Government has a paid-up license in this invention and the rig^t m 

limited circumstances to require the patent owner to license others on reasonable 
terms as provided for by the terms of Grant Nos. NIH CA77097, CA69577, CA41966, 
CA41996. and CA42978 awarded by Hae National Institute of Health. 

10 CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims priority to provisional applications U.S.S 
60/220,967, filed July 26, 2000, and U.S.SX 60/237,819 filed October 3, 2000, both 
of which are incorporated herein by reference* 

15 BACKGROUND OF THE INVENTION 

A major chall^ge of cancer treatment is to target specific therapies to distinct 
tumor types in order to maximize efGcacy and minimize toxicity. A related challenge 
lies in the attempt to provide accurate diagnostic, prognostic, and predictive 
information. At present, breast tumors are described with the tumor-node-metastasis 

20 (TNM) system. This system, which uses the size of the tumor, the presence or 
absence of tumor in regional lymph nodes, and the presence or absence of distant 
metastases, to assign a stege to the tumor is described in the American Joint 
Committee on Cancer: AJCC Cancer Staging Manual. Philadelphia, Pa: Lippincott- 
Raven Publishers, Sth ed., 1997, pp 171-180, and further discussion is found in Harris, 

25 JR: "Stagmg of breast carcinoma" in Harris, J.R., Hellman, S., Henderson, I.C., Kinne 
D.W. (eds.): Breast Diseases. Philadelphia, Lippincott, 1991. The assigned stage is 
used as a basis for selection of appropriate therapy and for prognostic purposes. In 
addition to the TNM parameters, morphologic appearance is used to further classify 
tumors and thereby aid in selection of appropriate therapy. However, this approach 

30 has serious limitations.v Tumors with similar histopathologic appearance can exhibit 
significant variability in terms of clinical course and response to therapy. For 
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example, some tumors are rapidly progressive while others are not Some tumors 
respond readily to hormonal therapy or chemotherapy while others are resistant 
Assays for cell surfece markers, e.g., using immunohistochemistry, have 
provided means for dividing certain tumor types into subclasses. For example, one 
S factor considered in prognosis and in treatment decisions for breast cancer is the 
presence or absence of the estrogen receptor (ER) in tumor samples. ER-positive 
breast cancers typically respond much more readily to hormonal Ifaer^ies such as 
tamoxifen, which acts as an anti-estrogen in breast tissue, than £R-negative tumors. 
Though useful, these analyses are subject to variability and provide only a very crude 
10 basis for tumor classification. Therefore, there exists a need for improved methods for 
classifying tumors. 

Mutation or dysregulation of any of a large number of genes contributes to the 
development and progression of cancer as discussed in Hanahan, D. and Weinberg, 
- R., The Hallmarics of Cancer, Cell, 100, 57-70, 2000. Genes that play a role in cancer 

15 can be divided into a number of broad classes including oncogenes, tumor suppressor 
genes, and genes that regulate apoptosis. Oncogenes such as ras typically encode 
proteins whose activities promote cellgrowth and/or division, a function that is 
necessary for normal physiological processes such as development, tissue 
regeneration, and wound healing. However, inappropriate activity or &ipicssion of 

20 oncogenes can lead to the uncontrolled cell proliferation tiiat is a feature of cancer. 
Tumor suppressor genes such as r& act as negative regulators of cell proliferation. 
Loss of their activity, e.g., due to mutations or decreased expression at the level of 
mRNA or protein, can lead to unrestrained cell division. A number of fiunilial cancer 
syndromes and inherited susceptibility to cancer are believed to be caused by 

25 mutations in tumor suppressor genes. Apoptosis, or programmed cell deafli, plays 
important roles both in normal development and in surveillance to eliminate cells 
whose survival may be deleterious to the organism, e.g., cells that have acquired DNA 
damage. Many chemotherapeutic agents are believed to work by activating the 
endogenous apoptosis pathway in tumor cells. 

30 Although a substantial number of genes have been implicated as playing 

important roles in cancer, the factors responsible for the phenotypic diversity of 
tumors remain largely unknown. In particular, understanding of the underlying 
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differences in gene expression that may contribute to tumor phenotype is limited. 

V^XiUVlOkUAXVAAU^ UAW UA J.A.V>1 WA^^O AAA ^^AlWf K.>/ypA VOOAV/Al UWWVWAA AAV/AAIACAA CiAA\J VC4J.1WWA 

tissue and between different tumors of the same tissue type is of significant 
diagnostic, prognostic, and therapeutic utility. There is therefore a need for the 
S identification of genes exhibiting difiTerential expression in tumors. In particular, 
there is a need for the identification of additional genes and protems that can be used 
to classify tumors, especially genes and proteins that can provide diagnostic, 
prognostic, and/or predictive information in cancer. There is also a need for 
antibodies and other reagents for the detection and measurement of such genes and 
10 proteins. 

Most of the commonly used chemotfaerapeutic agents act relatively 
nonselectively. Rather than specifically killing tumor oells» these agents tai^et any 
dividing cell, resulting in a variety of adverse effects. In addition, current therapeutic 
strategies are of limited efScacy, and the mortality rate of breast canc^ remains higji. 
IS There is therefore a need for the identification of additional genes and proteins that 
can be used as targets for the treatment of cancer. There is also a need for antibodies 
and other reagents that can modulate, regulate, or interact widi Ihese genes and 
proteins to provide new method of treatment for cancer. 

20 SUMMARY OF THE INVENTION 



Hie present invention relates to the identification of genes of particular import 
in diagnosis, prognostication and/or fhenqpeutic intervention in breast cancer and other 
tumors based on then- expression profile m human breast tumor samples, theur 
25 expression m other tissue and normal tissue samples, and in cell lines as assessed 
using cDNA microarrays. In particular, the genes are identified based on their 
differential expression across tumor samples. 

The invention provides a substantially purified polypeptide and fi'agments thereof that 
are encoded by an RNA molecule that is differentially expressed in human breast 
30 tumor samples and cell lines. The polypeptide is referred to herein as BSTP-Ras. The 
polypeptide is also referred to herein as BSlP-Ras/Rerg, or as Rerg (which stands for 
Ras-like estrogen-regulated growth-inhibitor gene), reflecting features and 
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characteristics of the polypeptide and the gene that encodes it as described more fully 
below. The gene that encodes this polypeptide is referred to herein as BST-RAS, BST- 
RAS/Rerg, oxRerg. 

Thus in one aspect the invention provides a substantially purified polypeptide 
5 whose amino acid sequence comprises the amino acid sequence set forth in SEQ ID 
N0:1. The invention also provides polypeptides possessing homology to the 
polypeptide having the sequence of SEQ ID N0:1 or to fragments of this polypeptide, 
wherein the polypeptides are significantly similar to the polypeptide of SEQ ID N0:1. 
In certain embodiments of the invention the definition of "significantly similar'" can 

10 vary, as described fiurther below. In certain embodiments of the mvention a 

significantly similar polypeptide has one or more amino acid substitutions, deletions, 
and/or additions with respect to the sequence of SEQ ID NO: 1 . In certain 
embodiments of the invention the polypeptides are expression products of human 
genes. When referring to polypeptides or polynucleotides whose sequence comprises 

15 the sequence set forth ui a SEQ ID NO:X, the set of polypeptides or polynucleotides 
includes those polypeptides or polynucleotides having the particular sequence set 
forth in the SEQ ID NO:X in addition to other polypeptides or polynucleotides 
including tfie sequence of SEQ ID NO:X. 

In mother aspect, die invention provides a substantially isolated and purified 

20 polynucleotide encoding the polypeptide of SEQ ID NO:L In particular, die invention 
provides a substantially isolated and purified polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:2, SEQ ID 
N0:3, SEQ ID NO:4, or SEQ ID NO:5. The invention also provides a 
polynucleotide encoding a polypeptide possessing significant similarity to the 

25 polypeptide of SEQ ID NO: 1 , where "significantiy similar** is defined below. The 
invention fiirdier provides a polynucleotide having a sequence that is complementary 
to the sequence of SEQ ID NO:2, SEQ ID N0:3, SEQ ID NO:4, or SEQ ID N0:5. In 
addition, the invention provides a polynucleotide having a sequence that is 
complementary to a polypeptide encoding a polypeptide possessing significant 

30 similarity to the polypeptide of SEQ ID NO: 1 . 

In another aspect, the invention provides an isolated and purified 
polynucleotide that hybridizes under stringent conditions to a polynucleotide encoding 
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a polypeptide comprising or having the amino acid sequence set forth in SEQ ID 

XT/^.i -J.: 1 it-- ; J.: !j : — t^^^j i tx? _ j i_. ..i. 

jLNx^.i, jui pmii^uiai, uic uivciiiiuii piuviucs aii iduiaicu miu puniicu puiynuucoiiue 

that hybridizes under stringent conditions to a polynucleotide having the sequence set 
forth in SEQ ID N0:2, SEQ ID N0:3, SEQ ID N0:4, SEQ ID N0:5, or fragments of 
5 any of these three sequences. In certain embodiments of the mvention the 

polynucleotide encodes at least 10%, at least 20%, at least 30%, at least 40%, at least 
50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 
99% of the polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 
1 .The invention fiutiier provides an isolated an purified polynucleotide that hybridizes 

10 under stringent conditions to a polynucleotide encoding a polypeptide having 

significant similarity to a polypeptide comprising or havmg the amino acid sequence 
set forth in SEQ ID NO: 1 . The invention also provides polynucleotides that hybridize 
to sequences as set forth above under moderately stringent conditions. 
In another aspect, the invention provides a substantially purified 

IS oligonucleotide that includes a region of nucleotide sequence that hybridizes to at 
least 8 consecutive nucleotides of sense or antisense sequence of a nucleotide 
sequence selected firom flie group consisting of SEQ ID NO:2, SEQ ID N0:3, SEQ ID 
NO:4, or SEQ ID NO:S. In certain embodunents of the invention Hie oligonucleotide 
is labeled, eg., with a fluorescent moiety, enzyme, enzyme substrate, or radioisotope, 

20 in order to fecilitate detection of the oligonucleotide. The oligonucleotide can be 
used, e.g., as a probe or a primer, to detect the level of a polynucleotide encodmg 
BSTP-Ras/Rerg m cells and/or tissues, e.g., tissues isolated fit)m a patient Hie 
oligonucleotide can also be used to detect mutations in the gene encoding BSTP- 
Ras/Rerg and/or to detect amplification or alt^^ expression of die gene encoding 

25 BSTP-Ras/R^g, in cells and/or tissues. In certain embodiments of the invention tiie 
oligonucleotide is attached to an oligonucleotide microarray. In certain embodiments 
of the invention the oligonucleotide has a sequence capable of binding specifically 
with an RNA molecule that encodes a polypeptide comprising an amino acid sequence 
set forth in SEQ ID NO: 1 so as to prevent appropriate processing, transport, or 

30 translation of the RNA molecule. 

In another aspect the invention provides vectors, e.g. plasmids, containing a 
polynucleotide encoding a polypeptide comprising the sequence of SEQ ID NO: 1 . 
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The invention also provides vectors, e.g., plasmids, comprising a polynucleotide 
encoding a polypeptide possessing significant similarity to the polypeptide of SEQ ID 
NO: 1 . In certain embodiments of the invention the polynucleotide comprises flie 
polynucleotide sequence of SEQ ID N0:2, SEQ ID NO:3, SEQ ID NO: 4, or SEQ ID 
S NO:S. In certain embodiments of the invention the vectors contain genetic control 
elements operably linked to the polynucleotide, wherein the genetic control elements 
direct transoiption of the polynucleotide, hi certain embodiments of the invention the 
vectors and genetic control elements are adapted for expression of the polynucleotide 
in a bacterial cell, a yeast cell, an msect cell, or a mammalian cell. The invention 

10 fiirther provides host cells, e.g., bacterial, yeast, insect, and mammalian cells 
containmg an expression vector containing the polynucleotide encoding the 
polypeptide having the sequence of SEQ ID NO: I . The invention fiirther provides 
host cells, e.g., bacterial, yeast, insect, and mammalian cells containing an expression 
vector comprising a polynucleotide encoding a polypeptide having significant 

15 similarity to the polypeptide of SEQ ID NO:l. 

In another aspect, the invention provides an antibody that specifically bind to a 
polypeptide whose sequ^ce comprises the ammo acid sequence of SEQ ID NO: 1 . 
The invention fiirdier provides antibodies that specifically bind to a polypeptide 
having significant similarity to the polypeptide of SEQ ID NO: 1 . The invention 

20 fiuther provides methods of detecting a polypeptide whose sequence comprises the 
amino acid sequence of SEQ ID NO: 1. The invention fiirther provides methods of 
detecting a polypeptide having significant similarity to the polypeptide of SEQ ID 
NO:l. The polypeptides can be detected in a variety of different contexts. For 
example, the polypeptides can be detected in lysates or extracts derived from cells or 

25 tissues, in culture medium, in substantially intact cells or tissue samples (e.g., biopsy 
specunens), and/or in the blood, urine, serum, ascites, or other body fluids or 
secretions of a subject 

In another aspect, the invention provides a nonhuman transgenic organism 
comprising a nonnative DNA molecule encoding a polypeptide comprising an amino 

30 acid sequence set forth m SEQ ID N0:1, or an ortholog of such a polypeptide, and 
including genetic control elements sufiBcient to direct transcription of the DNA 
molecule in at least a subset of the organism's cells. The invention further provides a 
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nonhuman transgenic organism comprising a nonnative DNA molecule encoding a 
polypeptide having significant similarity to the polypeptide of SEQ ID N0:1 and 
including genetic control elements sufficient to direct transcription of the DNA 
molecule m at least a subset of the oi:ganism's cells. In yet another aspect, the 
S invention provides a nonhuman organism in which a native DNA sequence encoding a 
polypeptide having significant similarity to the polypeptide of SEQ ID NO: 1 is 
deleted, e.g., by homologous recombmation. 

In another aspect, the invention provides methods for detectmg a 
polynucleotide encoding a polypeptide comprising an amino acid sequence set forth ui 

10 SEQ ID N0:1, or a polypeptide having significant similarity to the polypeptide of 
SEQ ID N0:1, in a biological sample. The sample may comprise cells, tissue, body 
fluid, etc., isolated firom a subject. The sample may be processed in a variety of ways 
prior to application of the methods. For example, the sample may be subjected to 
purification, reverse transcription, amplification, etc. One such method comprises 

IS steps of: (a) hybridizing a nucleic acid complementary to the polynucleotide encoding 
a polypeptide comprising an amino acid sequence set forth in SEQ ID N0:1, or a 
polypqptide having significant similarity to the polypeptide of SEQ ID NOrl, to at 
least one nucleic acid in the biological sample, thereby forming a hybridization 
complex; and G>) detecting the hybridization complex, wherem the presence of the 

20 hybridization complex mdicales the presence of a polynucleotide encoding the 

polypeptide m the biological sample. A second such method comprises steps of: (a) 
hybridizing a nucleic acid encoding a polypeptide comprising an amino acid sequence 
set forth in SEQ ID NO: 1, or a polypeptide having significant similarity to the 
polypeptide of SEQ ID N0:1, to at least one nucleic acid complementary to at least 

25 one nucleic acid in the biological sample, thereby forming a hybridization comply; 
and (b) detectmg Ifae hybridization complex, whmin the presence of the hybridization 
complex indicates the presence of a polynucleotide encoding the polypeptide m the 
biological sample. 

In yet another aspect, the invention provides kits for detecting a polynucleotide 
30 encoding a polypeptide comprising an ammo acid sequence set fi)rfh in SEQ ID NO:l, 
or a polypeptide havmg significant similarity to the polypeptide of SEQ ID NOil . 
The kits can comprise a polynucleotide that hybridizes specifically to the 
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polynucleotide encoding a polypeptide comprising an amino acid sequence set forth in 
SEQ ID NOrl, or a polypeptide having significant similarity to the polypeptide of 
SEQ E) N0:1 and, optionally, other materials such as suitable buffers, indicators 
(e.g., fluorophores, chromophores or enzymes providing same), controls (e.g., an 
S appropriate polynucleotide of this invention), controls, and directions for using the kit. 

In another aspect, the mvention provides methods for detecting a polypeptide 
comprising an amino acid sequence set forth in SEQ ID NO:l, or a polypeptide 
having significant similarity to tfie polypeptide of SEQ ID NO: 1 . One such method 

10 comprises steps of: (a) contacting the biological sample with an antibody that 
specifically binds to the polypeptide of SEQ ID NO: 1, or a polypeptide having 
significant similarity to the polypeptide of SEQ ID NO: 1 ; and (b) determinmg 
whether the antibody specifically bmds to the sample, the binding being an indication 
that the sample contains the polypeptide. The invention also provides kits for 

IS performing these methods. The kits can comprise an antibody (preferably a 

monoclonal antibody) that binds to the polypeptide and, optionally, otiier materials 
such as suitable buffers, indicators (e.g., fluorophores, chromophores or enzymes 
providing same), controls (e.g., a polypeptide of this invention) and du^ons for 
usuig the kit. 

20 In another aspect, the invention provides methods for producing a polypeptide 

comprising the ammo acid sequence of SEQ ID NO: 1 , or a polypeptide having 
significant similarity to the polypeptide of SEQ ID N0:1. In one embodunent the 
method mchides the steps of providing a cell that expresses the polypeptide or 
polypeptide fiagment, e.g., a cell containmg an expression vector containmg a 

25 polynucleotide encoding the polypeptide or polypeptide fi:agment operably linked to 
genetic control elements that direct transcription of the polynucleotide; mamtaming 
the cell under conditions wherein the polypeptide is produced; harvesting the 
polypeptide fix)m the cell and/or the culture medium; and, optionally, purifying the 
polypeptide. 

30 In another aspect, the invention provides methods of predicting whether an 

individual is at risk for a condition featuring inappropriate cell division, e.g., cancer. 
One such metiiod comprises the step of determining whether there exists, withm a cell 



wo 02/08282 



PCT/USOl/23574 



and/or tissue of an individual, a mutation in a gene encoding a polypeptide having an 
amino acid sequence sdected Erutn ihe group corisisllng of SEQ ID NO:!, or a 
polypeptide having significant similarity to the polypeptide of SEQ ID NO: I, or 
whether there exists, within a cell and/or tissue of an individual, a mutation in a 
S regulatory sequence for such a gene. A second such method comprises the step of 
deteraiming whether an individual expresses a particular allele or variant of a gene 
comprising a nucleotide sequence set forth in SEQ ID NO:2, SEQ ID N0:3, SEQ ID 
NO:4, SEQ ID N0:5, such allele or variant being present within the general 
population and inherited from either of the individuates parents rather than 

1 0 constituting a de novo mutation within the organism. A third such method comprises 
the step of determming whether there exists^ within a cell and/or tissue of an 
individual, inappropriate expression of a polynucleotide encoding a polypeptide 
having an ammo acid sequence set forth in SEQ ID NO:l or a polypeptide having 
significant similarity to the polypeptide of SEQ ID NO:L A fourth such method 

1 S comprises the step of determining whether there exists, within a cell and/or tissue of 
. an individual, mappropriate expression of a polypeptide havmg an ammo acid 
sequence set forth in SEQ ID NO:l or a polypeptide having significant similarity to 
tile polypeptide of SEQ ID NO:l . 

Li another aspect, the invention provides methods of classifying a disease, 

20 particularly of classifying tumors. One such method includes steps of: (a) obtaining 
cells or tissue fi-om a site of disease; (b) detecting a polynucleotide encodmg a 
polypeptide having a sequence selected fi-om the group consistmg of SEQ ID N0:1 or 
a polypq)tide having significant similarity to tiie polypeptide of SEQ ID NOrl, or a 
complement of such a polynucleotide; and (c) assigning tiie disease to one of a set of 

25 predetermined categories based on detection of the polynucleotide. The method can 
further comprise the step of providing diagnostic, prognostic, or predictive 
information based on the categoxy assigned in the assigning step. 

In certain embodiments of the invention the method is used to classify breast 
tumors. In certain embodiments of the invention the method is based on detecting 

30 expression of a smgle gene, e.g., a gene encoding the polypeptide of SEQ ID NOrl. 
Detecting expression of such a gene may comprise detecting the polynucleotide 
sequence set forth in SEQ ID N0:2, SEQ ID N0:3, SEQ ID NO:4, or SEQ ID NO:5. 
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Detecting expression may comprise measuring either a relative or absolute level of 
expression. In certain embodiments of the invention the method is based on an 
assessment of the expression of multiple polynucleotides as described fiirther in 
copending U.S. Patent Application ''Reagents and Methods for Use in Managing 
5 Breast Cancer, filed July 26, 2001 and in U.S. Provisional Patent Application Sen No. 
60/220,967, filed July 26, 2000. These applications are referred to herein as 'the gene 
subset application". The multiple polynucleotides can include all of Ifae 
polynucleotides disclosed therein or any subset thereof. 

In another aspect, the invention provides a method of classifying a tumor by 

10 detection of die polypeptide of SEQ ID NO: 1 or a polypeptide having significant 
similarity to the polypeptide of SEQ ID N0:1 m cells and/or tissue samples obtained 
6om the tumor or fit)m elsewhere in the body (e.g,, in blood, urine, ascites, other 
body fluids or secretions or excretions). In a preferred embodiment the method is 
used to classify breast tumors. In certain embodiments the method is based on the 

1 5 detection of a single polypeptide, e.g., the polypeptide of SEQ ID NO: 1 or a 
polypeptide havmg significant similarity to the polypeptide of SEQ ID NO: 1 . 
However, in other embodiments the method is based on an assessment of the 
expression of multiple polypeptides. The multiple polypeptides can include all of the 
polypeptides disclosed in the gene subset application mentioned above or any subset 

20 . thereof. In c^tain embodiments the level of egression (either relative; or absolute) of 
the polypeptide is measured. In other embodiments the pattern of expression of the 
polypeptide withm cells or withm a tissue sample is assessed. Regardless of the 
method by which a tumor is assigned to a predetermmed category, the assignment 
may be used as a basis to provide diagnostic, prognostic, and/or predictive 

25 information to the patient having the tumor. 

In another aspect, the mvention provides a microarray for use m classifymg 
tumors, comprising a polynucleotide whose sequence comprises or is complementary 
to that set forth in SEQ ID N0:2, SEQ ID N0:3, SEQ ID N0:4, or SEQ ID N0:5, or 
whose sequence is of sufiScient length to specifically bind to such a sequence under 

30 the microarray hybridization conditions employed. Such conditions may be those 
described in the Examples, or any conditions appropriate for the particular microarray 
and detection technology employed. The invention further provides a microarray for 
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use in classifying tumors comprising a polynucleotide, e.g., a cDNA or an 
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polypeptide having the amino acid sequence set forth in SEQ ID NO: 1, or capable of 
binding specifically to a polypeptide having significant similarity to the polypeptide 
5 of SEQ ID NO: 1, or complementary to such polynucleotides. 

In another aspect, the invention provides a pharmaceutical composition 
comprising a substantially purified polypeptide having the amino acid sequence set 
forth in SEQ ED N0:1 or a polypeptide having significant similarity to the polypeptide 
of SEQ ED NO: 1 . In certain embodiments of the mvention the pharmaceutical 

10 composition fiirther preferably comprises a pharmaceutically acceptable carrier. 

In yet another aspect, tiie invention provides a pharmaceutical composition 
comprising a substantially purified antibody that binds to a polypeptide having an 
amino acid sequence set forth in SEQ ID NO: 1 or a polypeptide having significant 
similarity to the polypeptide of SEQ ID NO: L In certain embodiments of the 

1 S mvention the pharmaceutical composition fiirther preferably comprises a 

pharmaceutically acceptable carrier. In certain embodiments the antibody is modified, 
e.g., by attaching a toxic molecule thereto. 

The invention provides methods for identifying modulators of the expression 
or activity of a polypeptide comprising the amino acid sequence set forth in SEQ ID 

20 NO: 1 or a polypeptide having significant similarity to the polypeptide of SEQ ID 
NO: 1 . Tlie invention further provides agonists and antagonists capable of modulating 
the expression or activity of a polypeptide comprising the ammo acid sequence set 
forth m SEQ ID NO: 1 or a polypeptide having significant similarity to the polypeptide 
of SEQ ID NO: 1 . The invmtion provides pharmaceutical compositions mcluding 

25 such modulators and methods of use thereof for the treatment or prevention of cancer, 
particularly breast cancor. 

In another aspect the mvention provides a method for the treatment or 
prevention of cancer comprising the step of administermg to an individual in need 
thereof, a pharmaceutical composition comprising a polypeptide havmg an ammo acid 

30 sequence comprising the sequence of SEQ ID NO: 1 or a polypeptide having 

significant similarity to the polypeptide of SEQ ID N0:1 . In another aspect, the 
invention provides a method for the treatment or prevention of cancer comprising the 
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Step of administering to an individual in need thereof, a pharmaceutical composition 
comprising an antibody or a modified antibody that bmds to a polypeptide having an 
amino acid sequence set forth in SEQ ID NO: I, or a polypeptide having significant 
similarity to the polypeptide of SEQ ID NO: 1 . 
5 In another aspect, the invention provides methods of screening for an inhibitor 

of cell grov^. The methods mvolve identifying a compound tiiat enhances the 
expression or activity of a polypeptide comprising the amino acid sequ^ce of SEQ 
ID N0:1 or a polypeptide having significant similarity to the polypeptide of SEQ ID 
N0:1. According to one embodiment, the method includes steps of: (1) contacting a 

10 polypeptide comprising the amino acid sequence of SEQ ID NO:l or a polypeptide, 
having significant similarity to the polypeptide of SEQ ID NO: 1 with a test 
compound; (2) measuring GTP-binding and/or GTPase activity of the polypeptide; 
and (3) identifymg the test compound as an activator of the polypeptide if the 
polypeptide exhibits enhanced GTP-binding activity and/or reduced GTPase activity 

13 in the presence of the test compound relative to the level of GTP-bindmg activity 
and/or GTPase activity in the absence of the test compound. In certain embodunents 
of the method the polypeptide bicludes a GlP-binding domain. The method may be 
performed m vitro, e.g., using a polypeptide produced using recombinant DNA 
technology, or may be performed in vivo, e.g., by applying the test compound to a cell 

20 that expresses die polypeptide and isolating the polypeptide &om die cell. In certain 
of the methods the polypq)tide may be a variant that includes a tag e.g., in order to 
facilitate isolation. 

In another aspect the invention provides methods of inhibiting growth of a 
cell comprising dancing die level or activity of a polypeptide comprising the amino 

25 acid sequence of SEQ ID NO: 1 or a polypeptide having significant similarity to die 
polypeptide of SEQ ID NO: 1 in the cell. The cell can be a normal cell or a tumor cell, 
e.g., a breast tumor cell. According to certain of the methods the level of the 
polypeptide is enhanced by overexpressing the polypeptide in ttie cell, e.g., by 
introducmg an expression vector or other nucleotide sequence (e.g., DNA, RNA, or 

30 modified nucleotides, etc.) that encodes the polypeptide into die cell. According to 
certain of the methods the activity of the polypeptide is enhanced by contactmg the 
cell with a compound that either directly or indirectly increases the GTP-binding 

12 
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activity of the polypeptide and/or reduces the GTPase activity of the polypeptide. 
Such a compound may foe identified using the inventive screening method described 
above. According to certain of the methods the activity of the polypeptide is 
enhanced by making an q)propriate mutation in the coding sequence, such that tiie 
S resulting variant displays increased GTP-binding activity and/or reduced GTPase 
activity. As discussed below, a number of such mutations m Ras femily members that 
have predictable effects on GTP-binding activity and/or GTPase activity are well 
knovm in Ifae art. 

In another aspect, the invention provides a method of treating or preventing a 

1 0 tumor comprismg steps of: (i) providing an mdividual in need of treatment or 
prevention of a tumor, (ii) administering a compound that enhances the level or 
activity of a polypeptide comprisuig the ammo acid sequence of SEQ ID NO:l or a 
polypeptide having significant similarity to the polypeptide of SEQ ID NO: 1 . In 
certain embodiments of the method the compound is identified using the screening 

IS assay described above. In certain embodiments of the invention the compound is 
provided as a component of a pharmaceutical composition. The invention also 
includes such a pharmaceutical composition. 

In another aspect, the invention provides a melliod of screening fi)r an 
activator of cell growth. The methods involve identifying a compound that reduces or 

20 inhibits the expression or activity of a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 1 or a polypeptide having significant sunilarity to the 
polypeptide of SEQ ID NOrl . According to one embodim^t, the method includes 
steps of: (1) contacting a polypqitide comprising the amino acid sequence of SEQ ID 
NO:l or a polypeptide having significant similarity to the polypeptide of SEQ ID 

25 NO:l with a test compound; (2) measuring GTP-binding and/or GTPase activity of 
the polypeptide; and (3) identifying the test compound as an inhibitor of the 
polypeptide if the polypeptide exhibits decreased GTP-binding activity and/or 
increased GTPase activity in the presence of the test compound relative to the level of 
GTP-binding activity and/or GTPase activity in the absence of the test compound. In 

30 certain embodiments of the method the polypeptide includes a GTP-binding domain. 
The method may be performed in vitro, e.g., using a polypeptide produced using 
recombinant DNA technology, or may be performed in vivo, e.g., by applying the test 
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compound to a cell that expresses the polypeptide and isolating the polypeptide fiom 
the ceil. In certain of the methods the polypeptide may be a variant that includes a 
tag, G,g., in order to facilitate isolation. 

In another aspect, the mvention provides a method of increasing cell growth 
S comprising: 

decreasing the level or activity of a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 1 or a polypeptide having significant similarity to the polypeptide of 
SEQ ID N0:1 in the cell* Various methods for decreasing the level of a polypeptide 
in a cell are known in the art. Such methods include, for example, the introduction or 

1 0 expression of antisense nucleic acids or double-stranded RNA (RNA-mediated 
interference) into the cell. Another such method is introduction or e^qpression of 
dominant negative polypeptides into the cell. According to certain of the methods tfie 
activity of the polypeptide is decreased by contacting the cell with a compound that 
either directly or indurectly decreases the GTP-buiding activity of the polypeptide 

IS and/or increases the GTPase activity of the polypeptide. Such a compound may be 
identified using the inventive screening method described above. According to 
certain of the methods the activity of the polypeptide is reduced by making an 
appropriate mutation in the coding sequence, such that the resulting variant displays 
decreased GTP-binding activity and/or increased GTPase activity. 

20 According to one aspect, the mvention provides a method of classifying a 

tumor comprising the steps of (i) providing a tumor sample, (ii) detecting expression 
or activity of a gene encoding the polypeptide of SEQ ID N0:1 in the sample; and (iii) 
classifying the tumor as belonging to a tumor subclass based on the results of the 
detecting step. The detecting step may comprise detecting the polypeptide. A variety 

25 of detection techniques may be employed including, but not limited to, 
immunohistochemical analysis, ELISA assay, antibody arrays, or detecting 
modification of a substrate by the polypeptide. 

In certam embodiments of the methods the tumor is a breast tumor and the 
tumor subclass is a luminal tumor subclass. The methods may further comprise 

30 providing diagnostic, prognostic, or predictive information based on the classifying 
step. Classifying may include stratifying the tumor (and thus stratifying a subject 
having the tumor), e.g., for a clinical trial. The methods may further comprise 
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selecting a treatment based on the classifying step. In clinical research, stratification is 
the process or result of describing or separatmg a patient population mto more 
homogeneous subpopulations accordmg to specified criteria. Stratifymg patients 
initially rather than after the trial is frequently preferred, e.g.» by regulatory agencies 
5 such as the U.S. Food and Drug Administration that may be mvolved in the approval 
process for a medication. In some cases stratification may be required by the study 
design. Various stratification criteria may be employed in conjunction with detection 
of expression of one or more basal marker genes. Commonly used criteria include 
age, &nily history, lymph node status, tumor size, tumor grade, etc. Otfier criteria 

1 0 including, but not limited to, tumor aggressiveness, prior therapy received by the 

patient, ER and/or PR positivity, Her2neu status, pS3 status, various other biomarkers, 
etc., may also be used. Stratification is fi^quently useful in performing statistical 
analysis of the results of atrial. 

In another aspect, the invention provides a method of testing a subject 

IS comprismg the steps of (i) providmg a sample isolated fi'om a subject, (ii) detecting 
expression or activity of a gene encodmg the polypeptide of SEQ ID NO: 1 in the 
sample, and (iii) providing diagnostic, prognostic, or predictive information based on 
the detectuig step. The detecting step may comprise detecting the polypeptide. 
Detection may be performed using any appropriate technique including, but not 

20 limited to, immunohistochemistry, ELISA assay, protein array, or detecting 
modification of a substrate by the polypeptide. 

The sample may comprise mRNA, in which case the detecting step may 
comprise hybridizing the mRNA or cDNA or RNA synthesized from the mRNA to a 
microarray or detecting mRNA transcribed from the gene or detecting cDNA or RNA 

25 synthesized from mRNA transcribed from the gene. In any of the above mediods, the 
sample may be a blood sample, a urine sample, a serum sample, an ascites sample, a 
saliva sample, a cell, and a portion of tissue. 

According to another aspect, the invention provides a method of testing a 
compound or a combination of compounds for activity against tumors comprising 

30 steps of (i) obtaining or providing tumor samples taken from subjects who have been 
treated with the compound or combination of compounds, wherem the tumors fall' 
within a tumor subclass, (ii) comparing the response rate of tumors that fall within the 
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tumor subclass and have been treated with the compound with the overall response 
rate of tumors that have been treated with the compound or combination of 
compounds or with the response rate of tumors that do not &1I within the subclass and 
have been treated with the compound or combination of compounds and (iii) 
S identifying the compound or combination of compounds as having selective activity 
against tumors in the tumor subclass if the response rate of tumors in the subclass is 
greater than the overall response rate or die response rate of tumors that do not fall . 
within the subclass. In certain embodiments of the invention the tumors are breast 
tumors. In certain embodiments of the invention tfie tumor subclass is a luminal 

10 tumor subclass. The tumors may be classified according to any of the inventive 
classification methods described above. In certain embodiments of the invention the 
classification is based on expression of the polypeptide of SEQ ID NO: 1 . 

The invention fiirther provides a metiiod of testing a compound or a 
combination of compounds for activity against tumors comprising steps of (i) treatiiig 

1 5 subjects in need of treatment for tumors with tiie compound or combination of 
compounds, (ii) comparing the response rate of tumors that &11 within a tumor 
subclass with the overall response rate of tumors or with the response rate of tumors 
that do not Ml within tiie subclass, and (iii) identifying the compound or combination 
of compounds as having selective activity agsunst tumors in the tumor subclass if the 

20 response rate of tumors in the subclass is greater than the overall response rate or the 
response rate of tumors that do not fall within the subclass. The method may further 
comprise various additional steps. For example, tlie method may comprise steps of (i) 
providing tumor samples from subjects in need of treatment for tumors, (ii) 
determining whether the tumors fall within a tumor subclass, and (iii) stratifying the 

25 subjects based on the results of the determining step prior to performing the treating 
step. The method may further comprise the steps of (i) providing tumor samples from 
subjects in need of treatment for tumors, (ii) detecting expression or activity of a gene 
encoding the polypeptide of SEQ ED N0:1 in the samples, and (iii) stratifying the 
subjects based on the results of the detecting step prior to performmg the treating step. 

30 

In addition, the invention includes a method of testing a compound or a 
combination of compounds for activity against tumors comprising steps of (i) treating 
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subjects in need of treatment for tumors with the compound or combination of 
compounds or with an alternate compound, wherein the tumors faii within a tumor : 
subclass, (ii) comparing the response rate of tumors treated with the compound or 
combination of compounds with the response rate of tumors treated with the alternate 
5 compound; and (iii) identifying the compound or combmation of compounds as 
havmg superior activity agamst tumors in the tumor subclass, as compared with the 
alternate compound, if the response rate of tumors treated with the compound or . 
combination of compounds is greater than the response rate of tumors treated with ftte 
alternate compound. The method may further comprise various additional steps. For 
1 0 example, the method may comprise steps of (i) providing tumor samples fiom 

subjects in need of treatment for tumors, (ii) determining whether the tumoi^ &11 . 
within a tumor subclass, and (iii) stratifying the subjects based on the results of the : 
determining step prior to performing fte treating step. The method may further 
comprise the steps of (i) providmg tumor samples j&om subjects in need of treatment 
1 S for tumors, (ii) detecting e>q}ression or activity of a gene encoding the polypeptide of 
SEQ ID N0:1 in the samples, and (iii) stratifying the subjects based on the results of 
the detecting step prior to performing the treating step. 

In certain embodiments of the invention the alternate compound is a 
compound approved by tfie U.S. Food and Drug admmistration for treatment of 
20 tumors. The invention also provides a method of treating a subject comprising steps 
of (i) identifydng a subject as having a tumor in a luminal tumor subclass, and (ii) 
administering to the subject a compound identified according to any of the inventive 
mediods for identifybig a subject 

The present application refers to various issued and pending patent 
2S applications, provisional patent applications, published articles, books, and other 
publications. The contents of all of these items are herein incorporated by reference. 
The present application also incorporates by reference six U.S. patent applications 
filed by inventors on July 26, 2001 . These applications are entitled "REAGENTS 
AND METHODS FOR USE IN MANAGING BREAST CANCER", "BSIP-ECGI 
30 PROTEIN AND RELATED REAGENTS AND METHODS OF USE THEREOF", 
*BSTP-CAD PROTEIN AND RELATED REAGENTS AND METHODS OF USE 
THEREOF", "BASAL CELL MARKERS IN BREAST CANCER AND USES 
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THEREOF", "BSTP-TRANS PROTEIN AND RELATED REAGENTS AND 
METHODS OF USE THEREOF", "BSTP-5 PROTEINS AND RELATED 
REAGENTS AND METHODS OF USE THEREOF". 

5 BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 presents the sequence of BSTP-Ras/Rerg (SEQ IDN0:1) and the sequences 
of polynucleotides that include sequ^ces that encode BSTP-Ras/Rerg (SEQ ID 
NO:2, SEQ ID N0:3, and SEQ ID NO:4). SEQ ID NO:2 is a 599 nucleotide open 
10 reading frame that encodes BSTP-Ras/Rerg. The open reading fiame is followed by 
an in-frame termination codon (TAG). In SEQ ID N0:3 and SEQ ID N0:4, the 
character '^N" stands for any nucleotide. 

Figure 2 presents the sequence of a full length cDNA that encodes the polypeptide of 
15 SEQ ID N0:1, i.e., BSTP-Ras/Reig. The sequence is designated as SEQ ID N0:5. 

Figure 3 presents a portion of the results of a search performed in GenBank using 
BLASTP with the amino acid sequence of BSTP-Ras/Rerg (SEQ ID NO: 1) as input 
The figure identifies polypeptides having homology to BSTP-Ras/Rerg. 

20 

Figure 4 presents a Northern blot showing expression of BST-RAS/RERG mKNA in 
various ceil lines. 

Figure S presents a summaiy of estrogen modulation microarray experhnents showmg 
25 effects of estrogen or tamoxifen treatment or estrogen deprivation on expression of 
BST'RAS/RERG and various other genes. 

Figure 6 presents an alignment of the human BSTP-Ras/Rerg amino acid sequence 
with the amino acid sequences of various other Ras fiunily members. 

30 
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Figure 7A presents a Western blot showing expression of GFP or a GFP-Rerg fusion 
GFP or GFP-Rerg. 

Figure 7B presents an epifluorescence micrograph showing expression of GFP or a 
GFP-Rerg fusion protein in living cells transiently transfected with DNA constructs 
encoding GFP or GFP-Rerg. 



Figure 8A presents a Western blot showing expression of HA-Rerg in MCF-7 cells 
stably expressing an empty vector and in clones overexpressing HA-Rerg (MCF-7 
1 0 stably transfected clones). 

Figure 8B presents growth curves for MCF-7 cells stably expressing empty vector and 
for HA-Rerg overe^qpressing clones. 

Figure 8C presents a graph showing reduced growdi m soft agar of HA-Rerg 
expressing clones relative to MCF-7 cells stably expressing empty vector. 
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BRIEF DESCRjQTlON OF THE TABLES 
The tables contain the numerical data corresponding to microarray images. Some 
table present additional mformation or list the individual genes in die various gene 
subsets. 



Table 1 is a master data table for the 65 microarray e?cperiments performed on 
individual tumor samples, in which rows represent I.M.A.G.E. clones that identify 
approximately 1753 genes whose expression varied by at least a fector of 4 and 
columns represent mdividual microarray experiments. The first 50 pages of the table 

25 consist of a reference list in which a descriptive name for each clone (where such a 
name exists) appears in the column entitled Name, followed by the Genbank 
accession number for the clone. Each row in the reference list contains a number in 
the first column that nummcally identifies the colunm. In the subsequent data portion 
of the table (pages 1 - 392), each row is similarly identified by a number in the first 

30 column so that the name and Genbank accession number for the clone for which data 
q)pears in that row may be determmed by consulting the reference list. In the data 
portion of the table, the column headings in the first row identify the tumor samples. 
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Each data cell in the table represents the measured Cy5/Cy3 fluorescence ratio at the 
corresponding target element on the appropriate array. Empty cells indicate 
insufficient or missing data. All ratio values are log transformed (base 2) to treat 
inductions or repressions of identical magnitude as numerically equal but with 
S opposite sign. 

Table 2 is a master data table for the 19 microaFray experiments performed on cell line 
samples, in whidi rows represent LM.A.G.E. clones that identify approximately. 1753 
genes whose expression varied by at least a factor of 4 and columns represent 

10 individual microarray e^qseriments. This table contains only a data portion, in which 
the colunm headings in the first row identify the cell lines. Each row in the table is 
identified by a number which appears in fte first column. The same reference list that 
forms part of Table 1 may be consulted to determine the name and Genbank accession 
number for the clone for which data appears in diat row. Each data cell in the table 

1 S represents the measured Cy S/Cy3 fluorescence ratio at the corresponding target 
element on the appropriate array. Empty cells indicate insufiScient or missing data. 
All ratio values are log transformed (base 2) to treat mductions or repressions of 
identical magnitude as numerically equal but with opposite sign. 

20 Table 3 presents a listing and description of the 1 1 cell lines used to create the ^ 
common reference sample. 

Table 4 presents a complete listing of the 84 experimental samples that were assayed 
versus the common reference sample. The table mcludes a list of alternate names (in 
25 the column entitled Sample ID/old name) for the same tumors. The alternate names 
are used to identify the tumor samples in certain contexts, and &e table allows 
conversion between the two sets of names. 

Table 5 lists the tumors used in the experiments described herein, along with clinical 
30 and pathological information about each tumor/patient. 

Table 6 is a master data table for the 84 microarray experiments performed on 
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individual tumor, tissue, and cell line samples, in which rows represent 
clones that identify the 496 genes in the intrinsic gene set, and columns represent 
individual microarray experiments. The &st 15 pages of the table consist of a 
reference list in which a descriptive name for each clone (where such a name exists) 
5 appears in the column entitled Name, followed by the Genbank accession number for 
the clone. Each row in the reference list contains a number in the first column that 
numerically identifies the column. In the subsequent data portion of the table (pages 1 
- 91), each row is similarly identified by a number in the first column so that the 
name and Genbank accession number for fiie clone for which data appears in that row 

10 may be determined by consulting the reference list. In the data portion of die table, 
the column headings in the fffst row identify the tumor samples. Each data cell in the 
table represents the measured Cy5/Cy3 fluorescence ratio at tihie corresponding target 
element on the appropriate array. Empty cells indicate insufiBcient or missing data. 
All ratio values are log transformed (base 2) to treat inductions or repressions of 

1 5 identical magnitude as numerically equal but with opposite sign. 

Table 7 is a listing of the 374 clones feat identify genes selected for the epifeelial 
enriched gene set includmg Genbank accession numbers. 

20 Tables is a listing of fee clones feat identify genes feat comprise fee luminal subset 
including Genbank accession numbers. 

Tables 9-1 and 9-2 are listings of fee two groups of clones feat identify genes feat 
comprise fee basal subset including Genbank accession numbers. 

25 

Table 10 is a listing of fee clones that identify genes feat comprise fee ErbB2 subset 
includuig Genbank accession numbers. 

Table 1 1 is a listing of the clones feat identify genes that comprise fee endofeelial 
30 gene subset including Genbank accession numbers. 

Table 12 is a listmg of the clones feat identify genes feat comprise fee 
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Stromal/fibroblast gene subset including Genbank accession numbers. 

Table 13 is a listing of the clones that identify genes that comprise the B-cell gene 
subset including Genbank accession numbers. 

5 

Table 14 is a listing of tiie clones that identify genes that comprise the adipose- 
enriched/normal breast gene subset including Genbank accession numbers. 

Table IS is a listing of the clones that identify genes that comprise the macrophage 
1 0 gene subset mcluding Genbank accession numbers. 

Table 16 is a listing of the clones that identify genes that comprise the T-cell gene 
subset including Genbank accession numbers. 

IS In Table 1» the Genbank accession number for each clone appears in the column 
entitled •'Name", following a brief descriptive name for the gene identified by the 
clone, where available. In some cases the descriptive name is a number corresponding 
to an I.M.A.G£. clone ID mmiber. As is well known and accepted in the art, the 
Genbank accession number represents a means of definitively identifydbg a particular 

20 clone, since Genbank accession numbers will be maintained permanently or, if 

changed, the.change will be accomplished in such a manner as to allow unambiguous 
correlation between any new numbering system and the numbering system currently 
in use. 

Note that Tables 1, 2, and 6 are provided for purposes of presenting the clone 
2S identifications and the data that was used to perform hierarchical clustering analysis^ 
and tiiat the format of the tables may not correspond exactly with the format required 
by software developed for the analysis of the data. Appropriate format will, in 
general, depend upon the particular computer program. See, for example, the Web 
site http://genome-www.stanford.edu/~sherlock/tutoriaI.html for discussion of the 
30 appropriate format for one particular analysis program. 

In Tables 7-16, each entry identifies a clone. The first portion of each entry is a 
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brief descriptive name for the gene identified by the clone. The Genbank accession 
number &r the clcna appears Ou tlie last lliic of the ciiiry for iiiat clone. 

DETAILED DESCRIPTION OF CERTAIN EMBODIMENTS 

5 

DEFINraONS 

To fecilitate understanding of the description of the invention, the foUowmg 
definitions are provided. It is to be understood that, in general, terms not otherwise 
1 0 defined are to be given their meaning or meanings as generally accepted in the art 

Agonist: As used herein, the term "agonist" refers to a molecule that increases or 
prolongs the duration of the effect of a polypeptide or a nucleic acid. Agonists may 
include proteins, nucleic acids, carbohydrates, lipids, small molecules, ions, or any 

IS other molecules that modulate the effect of the polypeptide or nucleic acid. An 

agonist may be a dkect agonist, in which case it is a molecule that exerts its effect by 
bmding to Ae polypeptide or nucleic acid, or an indirect agonist in which case it 
exerts its effect via a mechanism other than binding to the polypeptide or nucleic acid 
by altering expression or stability of the polypeptide or nucleic acid, by altermg 

20 the expression or activity of a target of the polypeptide or nucleic acid, by mteracting 
with an intermediate in a pathway mvolvmg the polypeptide or nucleic acid, etc.) 

Antagonist: As used herein, the term "antagonist" refers to a molecule that decreases 
or reduces the duration of tiie effect of a polypeptide or a nucleic acid. Antagonists 

25 may include proteins, nucleic acids, carbohydrates, or any other molecules that 

modulate the effect of the polypeptide or nucleic acid. An antagonist may be a dfrect 
antagonist, in which case it is a molecule that exerts its effect by binding to the 
pol3^ptide or nucleic acid, or an mdhect antagonist, in which case it exerts its effect 
via a mechanism other than bindmg to the polypeptide or nucleic acid (e.g., by 

30 altering expression or stability of the polypeptide or nucleic acid, by altermg the 

expression or activity of a target of the polypeptide or nucleic acid, by interacting with 
an intermediate in a pathway involving the polypeptide or nucleic acid, etc.) 

23 
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Corresponding to: In general, the phrase "corresponding to" has its commonly 
accepted meaning indicating, typically, a relationship between two entities, etc. For 
example, a mRNA corresponds to a gene if the mKNA is transcribed from the gene. 
S A protein corresponds to a gene if the protein is translated from an mSNA transcribed 
from the gene. A cDNA corresponds to an mRNA if the cDNA is synthesized by 
reverse transcription of the mRNA. In addition, and without limitation, as used 
herein, an mRNA corresponds to a clone on a microarray when the mRNA (or cDNA 
derived therefrom) hybridizes specifically (under the e)q[)erimental conditions 

10 described) to the clone or to its complement, e.g., when the sequence of the mRNA 
(or cDNA derived therefrom) and the sequence of the clone are sufBciently 
complementary to one another for specific hybridization to occur. Sunilarly, a gene 
corresponds to a clone on a microarray when mRNA transcribed from the gene 
corresponds to the clone. Note that it is not necessary that the entire mRNA, cDNA, 

15 etc. hybridize with the clone or vice versa. For example, the niRNA or cDNA may be 
longer or shorter than the clone. The clone may be longer or shorter than the mRNA 
or cDNA. Eitiier or both of the mRNA/cDNA and the clone may contain one or more 
stretches of sequence that is/are not contained within the corresponding nucleic acid. 

20 Diagnostic information: As used herein, diagnostic information or information for 
use in diagnosis is any mformation that is usefiil in determining whether a patient has 
a disease or condition and/or in classifying the disease or condition into a phenotypic 
category or any category having significance with regards to the prognosis of or likely 
response to treatment (eitiber treatment in general or any particular treatment) of the 

25 disease or condition. Similarly, diagnosis refers to providing any type of diagnostic 
information, including, but not limited to, whether a subject is likely to have a 
condition (such as a tumor), information related to the nature or classification of a 
tumor, information related to prognosis and/or information usefiil in selecting an 
appropriate treatment. Selection of treatment may include the choice of a particular 

30 chemotherapeutic agent or other treatment modality such as surgery, radiation, etc., a 
choice about whether to withhold or deliver therapy, etc. 
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Differential expression: A gene exhibits differential expression at the RNA level if its 
RNA transcript vanes in abunuancc between uifLcrcnl samples in a sample set. A; 
gene exhibits differential expression at the protein level, if a polypeptide encoded by 
the gene varies in abundance between different samples in a sample set. In the 
5 context of a microarray experiment, differential expression generally refers to 
differential expression at the RNA level 

Gene: For the purposes of the present invention, the term "gene" has its meaning as 
understood in the art. However, it will be appreciated by ftiose of ordmaiy skill m the 

10 art that the term "gene" has a variety of meanmgs m tfie art, some of which mclude 
gene regulatoiy sequences (e.g., promoters, enhancers, etc.) and/or mtron sequences, 
3* untranslated regions, etc., and others of which are limited to coding sequences. It 
will flirther be appreciated that definitions of "gene" include references to nucleic 
acids that do not encode proteins but rather encode functional SNA molecules such as 

15 tiRNAs. For flie purpose of clarity we note that, as used in the present application, the 
term "gene" generally refers to a portion of a nucleic acid that encodes a protein; the 
term may optionally encompass regulatory sequences. This defioiition is not mtended 
to exclude application of the term "gene" to non-protem coding expression units but 
rather to clarify that, in most cases, the term as used in this document refers to a 

20 protein codmg nucleic acid. 

Gene product or expression product: A gene product or expression product is, in 
general, an RNA transcribed from the gene or a polypeptide encoded by an RNA 
transcribed from the gene. 

25 

Homology: The term "homology" refers to a degree of similarity betweai two or 
more nucleic acid sequences or between two or more amino acid sequences. As is 
well known m the art, given any nucleotide or amino acid sequence, homologous 
sequences may be identified by searching databases (e.g., GenBank, EST [e)q)ressed 
30 sequence tag]databases, GST [gene sequence tag] databases, GSS [genome survey 
sequence] databases, organism sequencing project databases) using computer 
programs such as BLASTN for nucleotide sequences and BLASTP, gapped BLAST, 
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and PSI-BLAST for amino acid sequences. These programs are described in Altschul, 
SF, et al., Basic local alignment search tool, J. Mol Biol, 215(3): 403-410, 1990, 
Altchul, SF and Gish, W, Methods in Emymology, and Altschul, SF, et al^ "Gapped 
BLAST and PSI-BLAST: a new generation of protein database search programs", 
5 Nucleic Acids Res. 25:3389-3402, 1 997. In addition to identifying homologous 
sequences, the programs mentioned above typically provide an indication of the 
degree of homology. Determining the degree of identity or homology that exists 
between two or more amino acid sequences or between two or more nucleotide 
sequences can also be conveniently performed using any of a variety of other 
10 algorithms and computer programs known in the art Discussion and sources of 
appropriate programs may be found, for example, in Baxevanis, A., and Ouellette, 
B.F.F., Bioinformatics : A Practical Gtdde to the Analysis of Genes and Proteins, 
Wiley, 1998; mi Misener, S. and Krawetz, S. (eds.), Bioinformatics Methods and 
Protocols (Methods in Molecular Biology, Vol. 1 32), Humana Press, 1 999. 

15 

Operably linked: The term "operably Imked" refers to a relationship between two 
nucleic acid sequences wherein the expression of one of the nucleic acid sequences is 
controlled by, regulated by, modulated by, etc. the other nucleic acid sequence. For . 
example, a promoter is operably linked with a coding sequence if the promoter 
20 controls transcription ofthe coding sequence. Preferably a nucleic acid sequence that 
is operably linked to a second nucleic acid sequence is covalently linked, either 
directly or indirectly, to such a sequence, although any effective three-dimensional 
association is acceptable. 

25 Prognostic information and predictive information: As used herein the terms 
prognostic information and predictive information are used interchangeably and 
somewhat informally to refer to any information that may be used to foretell any 
aspect ofthe course of a disease or condition either in the absence or presence of 
treatment. Such information may include, but is not limited to, the average life . 

30 expectancy of a patient, the likelihood that a patient will survive for a given amount of 
time (e.g., 6 months, 1 year, 5 years, etc.), the likelihood that a patient will be cured of 
a disease, the likelihood that a patient's disease will respond to a particular therapy 
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(wherein response may be defined in any of a variety of ways). Prognostic and 
predictive information are included within the broad category of diagnostic 
information. 

S Sample: As uised herein, a sample obtained from a subject may include, but is not 
limited to, any or all of the following: a cell or cells, a portion of tissue, blood, serum, 
ascites, urme, saliva, and other body fluids, searetions, or excretions. The term 
'"sample" also includes any material derived by processmg such a sample. Derived 
samples may include nucleic acids or proteins extracted from the sample or obtained 
10 by subjecting the sample to techniques such as amplification or reverse transcription 
ofmRNA, etc. 

Specific binding: As used herein, the term refers to an interaction between a target 
polypeptide (or, more generally, a target molecule) and a binding molecule such as an 

1 5 antibody, agonist, or antagonist The interaction is typically dependent upon the 
presence of a particular structural feature of the target polypeptide such as an 
antigenic determinant or epitope recognized by the bindmg molecule. For sample, if 
an antibody is specific for epitope A, the presence of a polypeptide containing epitope 
A or the presence of fi^e unlabeled A in a reaction containing both fi^e labeled A and 

20 the antibody tiiereto, will reduce the amount of labeled A that binds to the antibody. 
It is to be understood that specificity need not be absolute. For example, it is well 
known m the art that numerous antibodies cross-react with other epitopes m addition 
to those present in the target molecule. Such cross-reactivity may be acceptable 
dependmg upon the application for which the antibody is to be used. Thus the degree 

25 of specificity of an antibody will depend on the context in which it is being used. In 
general, an antibody exhibits specificity for a particular partner if it favors binding of 
that partner above binding of other potential partners. One of ordinary skill in the art 
will be able to select antibodies having a suflBcient degree of specificity to perform 
appropriately in any given application (e.g., for detection of a target molecule, for 

30 therapeutic purposes, etc). It is also to be understood that specificity may be 
evaluated in the context of additional factors such as the aflEnity of the binding 
molecule for the target polypeptide versus the aflBnity of the binding molecule for 



27 



wo 02/08282 



PCTAJSOl/23574 



Other targets, e.g., competitors. If a binding molecule exhibits a high affinity for a 
target molecule that it is desired to detect and low affinity for nontarget molecules, the 
antibody will likely be an acceptable reagent for immunodiagnostic purposes. Once 
the specificity of a binding molecule is established in one or more contexts, it may be 
5 employed in other, preferably similar, contexts without necessarily re-evaluating its 
specificity. 

Treating a tumor: As used herein, treating a tumor is taken to mean treating a subject, 
who has the tumor. 

10 

Tumor sample: The tenn ^tumor sample" as used herein is taken broadly to include 
cell or tissue samples removed fi*om a tumor, cells (or iheir progeny) derived from a 
tumor that may be. located elsewhere in the body (e.g., cells in the bloodstream or at a 
site of metastasis), or any material derived by processing such a sample. Derived 
1 5 tumor samples may include nucleic acids or proteins extracted fi'om the sample or 
obtained by subjecting the sample to techniques such as amplification or reverse 
transcription of mRNA, etc. 

Tumor subclass: A tumor subclass, also referred to herein as a tumor subset or tumor 
20 class, is the group of tumors that display one or more phenotypic or genotypic 
characteristics that distinguish members of the group from other tumors. 

Vector: A vector, as used herein, is a nucleic acid molecule that includes sequences 
sufGcient to direct in vivo or in vitro replication of the molecule. These may either be 

25 self-replication sequences or sequences sufiBcient to direct integration of the vector 
into another nucleic acid present m a cell (eitiier an endogenous nucleic acid or one 
introduced into the cell by experimental manipulation), so that tiie vector sequences 
are replicated during replication of this nucleic acid. Preferred vectors include a 
cloning site, at which foreign nucleic acid molecules may be introduced. Vectors may 

30 include control sequences that have the ability to direct in vivo or in vitro expression 
of nucleic acid sequences introduced into the vector. Such control sequences may 
include, for example, transcriptional control sequences (e.g., promoters, enhancers. 
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terminators, etc.), splicing control sequences, translational control sequences, etc. 
sequences introduced into die vector results in production of a fusion protein. 
5 L Overview 

The invention is based on the identification of polynucleotides (cDNAs) 
corresponding to human genes that are differentially ejqjressed in human breast tumor . 
samples, the polypeptides encoded by these polynucleotides, and antibodies raised 
against these polypeptides. The invention encompasses the use of these 
1 0 polynucleotides, polypeptides, and antibodies as well as compositions containing 
them, either smgly or in combination, in the prediction, diagnosis, treatment, or 
prevention of cancCT and in the provision of prognostic and predictive information 
related to cancer. 

Nucleic acids encodmg BSTP-Ras/Rerg were identified based on e>q>ression 
15 profiles gafliered in a series of cDNA microarray experiments. As described m more 
detail in Examples 1, 2, and 4, cDNA microarrays each representing the same set of 
approximately 8100 different human genes were produced. The human cDNA clones 
used to produce the microarrays contained approxunately 4000 named genes, 2000 
genes with homology to named genes m other species, and approximately 2000 ESTs 
20 of unknown function. An mRNA sample was obtained fi'om each of a set of 84 tissue 
samples or cell lines. The e^qiression levels of the approximately 8100 genes were 
measured m each mRNA sample by hybridization to an mdividual microarray, 
yielding an e3q}ression profile for each gene across the experimental samples. These 
expression profiles were studied and compared and were used to identify nucleic acids 
25 encodmg BSTP-Ras«lerg. Althougji more details will be found m tfie Examples, a 
description of cDNA microarray technology and a description of the e3q)erimental 
approach employed in the identification of flie polynucleotides that encode BSTP- 
Ras/Rerg is presented here so that the invention may be better understood. Certain 
aspects of the mvention are then described m detail. 

30 

n. cDNA Microarray Technology 
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cDNA microarrays consist of multiple (usually thousands) of dijBferent cDNAs 
spotted (usually using a robotic spotting device) onto known locations on a solid 
support, such as a glass microscope slide. After spotting, the cDNAs are usually 
cross-linked to the support, e.g., by UV irradiation. The cDNAs are typically obtained 
5 by PCR amplification of plasmid library inserts using primers complementary to the 
vector backbone portion of the plasmid or to the gene itself for genes where sequence 
is known. PCR products suitable for production of microarrays are typically between 
O.S and 2.S kB in length. Full length cDNAs, expressed sequence tags (ESTs), or 
randomly chosen cDNAs from any library of interest can be chosen. ESTs are 
10 partially sequenced cDNAs as described, for example, in L. Hillier, et al., Generation 
and analysis of 

280,000 human expressed sequence tags. Genome Research^ 6, 807-828, 1996. The 
afore-mentioned article is h^in incorporated by reference, as are the entire teachings 
of all other patents and journal articles mentioned herein, for all purposes and not just 
IS those related to the particular context in which they are mentioned. Although some 
ESTs correspond to known genes, frequentiy very littie or no information regarding 
any particular EST is available except for a small amount of 3' and/or S* sequence and, 
possibly, the tissue of origin of the mRNA from 

which the EST was derived. As will be appreciated by one of ordinaiy skill in the art, 
20 in general the cDNAs contain sufficient sequence information to uniquely identify a 
gene within the human genome. Furthermore, in general the cDNAs are of sufficient 
length to hybridize specifically to cDNA obtained from mKNA derived from a single 
gene under the hybridization conditions of the e^eriment. 

In a typical microarray experiment, a microanay is hybridized with 
25 differentially labeled RNA or DNA populations derived fix)m two diflferent samples. 
Most commonly RNA (either total RNA or poly A* RNA is isolated from cells or 
tissues of interest and is reverse transcribed to yield cDNA. Labelmg is usually 
performed during reverse transcription by incorporating a labeled nucleotide in the 
reaction mixture. Although various labels can be used, most commonly the 
30 nucleotide is conjugated with the fluorescent dyes Cy3 or Cy5. For example, Cy5- 
dUTP and Cy3-dUTP can be used. cDNA derived from one sample (representing, for 
example, a particular cell type, tissue type or growth condition) is labeled with one 
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fluor while cDNA derived from a second sample (representing, for example, a 
different cell type, tissue type, or growui condition) is labeled with the second fiuor. 
Similar amounts of labeled material from the two samples are cohybridized to the 
microarray. In the case of a microarray experiment in which the samples are labeled * 
5 with Cy5 (which fluoresces red) and Cy3 (which fluoresces green), the prfanary data 
(obtained by scannmg the microarray using a detector capable of quantitatively 
detecting fluorescence intensity) are ratios of fluorescence intensity (red/green, R/G). : 
These ratios represent the relative concentrations of cDNA molecules that hybridized 
to tihie cDNAs 

10 represented on the microarray and thus reflect the relative expression levels of the 
mRNA coirespondmg to each cDNA/gene represented on the microarray. 

Each microarray experiment can provide tens of thousands of data points, each 
representing the relative expression of a particular gene in the two samples. 
Appropriate organization and analysis of the data is of key unportance. Various 

1 5 computer programs have been developed to fecilitate data analysis. One basis for 
organizing gene expression data is to group genes with similar ejqjression patterns 
together mto clusters. A method for performing hi^archical cluster analysis and 
display of data derived from microarray e>q)eriments is described in Eisen, M., 
Spelhnan, P., Brown, P., and Botstein, D., Cluster analysis and display of genome- 

20 wide expression pattems, Proc. Natl. Acad. Sci. USA, 95: 14863-14868, 1998. As 
described therein, clustering can be combined with a graphical representation of the 
primary data m which each data point is represented with a color that quantitatively 
and qualitatively represents that data point By converting the data fit)m a large table 
of numbers into a visual format, tiiis process facilitates an intuitive analysis of the 

25 data. Additional information and details regardmg the maHi^atical tools and/or fhe 
clustering approach itself may be found, for example, in Sokal, R.R. & Sneath, 
PJH.A. Principles of numerical taxonomy, xvi, 359, W. H. Freeman, San 
Francisco,1963; Hartigan, J A. Clustering algorithms, xiii, 351, Wiley, New York, 
1975; Paull, K.D. et al. Display and analysis of pattems of differential activity of 

30 drugs against human tumor cell lines: development of mean graph and COMPARE 
algorithm. J Natl Cancer Inst 81, 1088-92,1989; Weinstein, J.N. et al. Neural 
computing in cancer drug development: 
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predicting mechanism of action. Science 258, 447-51, 1992; van Osdol, W.W., 
Myers, T.G., Paull, KD., Kohn, K.W. & Weinstein, J.N. Use of the Kohonen self- 
organizing map to study the mechanisms of action of chemotherapeutic agents. J Natl 
Cancer Inst 86, 1853-9, 1994; and Weinstein, J.N. etal An information-intensive 
5 approach to the molecular pharmacology of cancer. Science^ 275, 343-9, 1997. 

Further details of the experimental methods used in the present invention are 
found in the Examples. Additional information describing methods for fabricating 
and using microarrays is found in U.S. Patent No. 5,807,522, which is herein 
incorporated by reference. Instructions for constructing microarray hardware (e.g., 

10 arrayers and scanners) using commercially available parts can be found at 

http://cmgm.stanford.edu/pbrown/ and m Cheung, V., Morley, M., Aguilar, F., 
Massimi, A., Kucherlapati, R., and Childs, G., "Making and readmg microarrays" 
Nature Genetics Supplement^ 21:15-19, 1999. Additional discussions of microarray 
technology and protocols for preparing samples and performing microrarray 

15 experiments are found in, for example, DNA arrays for analysis of gene expression, 
Methods Enzymol, 303 : 1 79-205, 1 999; Fluorescence-based expression monitoring 
using microarrays, Methods Enzyntol, 306: 3-18, 1999; and M. Schena (ed.), DNA 
Microarrays: A Practical Approach, Oxford University PresSj Oxford, UK, 1999. 
Descriptions of how to use an arrayer and the associated software are found at 

20 http://cmgm.stanford.edu/pbrown/mguide/anayerHlMiyA^ which is 

herein incorporated by reference. 

in. Experimental Approach of the Invention 

The present invention encompasses the realization that genes that are 

25 di£ferentially expressed in tumors are of use in tunior classification and are targets for 
the development of diagnostic and therapeutic agents. Differentially expressed genes 
are likely to be responsible for the different phenotypic characteristics of tumors. The 
present invention identifies one such gene. In general, a differentially expressed gene 
is a gene whose transcript abundance varies between different tumor samples. For 

30 example, and without intendmg to be limiting, the transcript level of a differentially 
expressed gene may vary by at least fourfold trom its average abundance in a given 
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sample set in at least 1 sample, at least two samples, at least three samples, etc. Of 
course other criteria for difierentiai expression may be employed. 

While analysis of multiple genes is of use in developing a robust classification 
of tumors, each of the differentially expressed genes and Hieir encoded proteins is a 
S target for the development of diagnostic and therapeutic agents. Investigation of 
variation in individual genes in breast tumors reveals that molecular variation can be 
related to important features of clinical variation. For example, expression of the 
estrogen receptor alpha gene (ESRl), the Erb-B2/HER2/neu oncogene (the target for 
the monoclonal antibody Herceptin* (Trastuzumab), a recently approved treatment for 

10 certain patients with metastatic breast cancer), and flie mutational status at the TP53, 
BRCAl and BRCA2 loci have shown that molecular variation can be related to 
important features of clmical variation. (Discussed, for ©cample, in Osbome, C.K., et 
a/.. The value of estrogen and progesterone receptors in Ae treatment of breast cancer. 
Cancer 46, 2884-2888, 1980; Ingvarsson, S., Molecular genetics of breast cancer 

15 progression. Seminars in Cancer Biology, 9, 277-288, 1999; Breast Cancer Lmkage 
Consortium, Pathology of familial breast cancer: differences between breast cancers in 
carriers of BRCAl and BRCAl mutations and sporadic cases, Lancet, 349, 1505-1510, 
1997; Anderson, T. I., et al.. Prognostic significance of TP53 alterations in breast 
carcinoma. Br J Cancer, 68, 540-548, 1993 and references cited in these articles). In 

20 particular, approximately 60% to 70% of breast tumors express the estrogen receptor, 
and this expression has been shown to be a fevomble prognostic factor (reviewed in 
AUred, D.C., et al. Prognostic and Predictive Factors in Breast Cancer by 
Inmiunohistochemical Analysis, Afo&/7iP<atfAotog);, 11(2), 155-168, 1998). As these 
examples demonstrate, the study of genetic variation m breast cancer has the potential 

25 to contribute to improved classification, diagnosis, and therapy for patients. 

As described m more detail m Examples 1, 2, and 4, in order to identify genes 
that are differentially expressed in breast tumors, cDNA microarrays each 
representing the same set of approximately 8100 different human genes were 
produced. The human cDNA clones used to produce the microarrays contained 

30 approximately 4000 named genes, 2000 genes with homology to named genes in other 
species, and approximately 2000 ESTs of unknown function. An mRNA sample was 
obtained fi-om each of a set of 84 tissue samples or cell lines. The expression levels of 
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the approximately 8100 genes were measured in each mRNA sample by hybridization 
to an individual microarray, yielding an expression profile for each gene across the 
experunental samples. Although more details will be found in the Examples, an 
overview of the experimental procedure is presented here so that the invention may be 
S better understood. 

Variation in patterns of gene expression were characterized in 62 breast tumor 
samples from 40 different patients, 3 normal breast tissue samples, and 19 samples 
from 17 cultured human cell lines (one of which was sampled 3 times under different 
conditions). Twenty of the tumors had been sampled twice, before and after a 16 week, 

1 0 course of doxorubicm chemotherapy, and two tumors were paired with a lymph node 
metastasis from the same patient The other 18 tumor samples were single samples 
from individual tumors. A detailed listing of the tumor samples and various 
characteristics including clinical estrogen receptor and Erb-B2 status as assessed using 
antibody staining, estrogen receptor and £rb-B2 status as assessed by microarray 

IS result, tumor grade, differentiation, survival status and time, age at diagnosis, 
doxorubicin response, and p53 status is presented in Table 5 of the gene subset 
application. A listing of the cell lines including description and ATCC (American 
Tissue Culture Collection) number or reference is presented in Table 3 of the gene 
subset application. The cell lines provided a frameworic for interpreting the variation 

20 in gene expression patterns seen in the tumor samples and mcluded gene expression 
models for many of the cell types encountered in tumors. 

As described m more detail in Example 2, mRNA was isolated from each 
sample. cDNA labeled with the fluorescent dye CyS was prepared from each 
experimental sample separately. Fluorescently labeled cDNA, labeled using a second 

25 distinguishable dye (Cy3), was prepared fix)m a pool of mRNAs isolated firom 1 1 
different cultured cell lines. The pooled mRNA sample served as a reference to 
provide a common internal standard against which each gene*s expression in each 
experimental sample was measured. 

Comparative e}q)ression measurements were made by separately mixing CyS- 

30 labeled experimental cDNA derived fit>m each of the 84 samples with a portion of Ihe 
Cy3-labeled reference cDNA, and hybridizing each mixture to an individual cDNA 
microarray. The ratio of CyS fluorescence to Cy3 fluorescence measured at each 
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cDNA element on the microarray was then quantitatively measured. The use of a 
common iefefence standard in each hybridization allowed the fluorescence ratios to be 
treated as comparative measurements of the expression level of each gene across all 
the experimental samples. 
5 A hierarchical clustering method (Eisen, et aL, 1998) was used to group genes 

based on similarity in the pattern with which their expression varied over all 
ejqjerimental samples. The same clustering method was used to group the 
experimental samples (tissue and cell lines separately) based on the similarity in their 
patterns of expression. Interpretation of the data obt^ned from the clustering 
1 0 algorithm was fecilitated by displaying the data in die form of tumor and gene 
dendrograms. In the tumor dendrograms, the pattem and length of the branches 
reflects the relatedness of the tumor samples with respect to thefr expression of genes 
represented on the microarray. Microarray unages may be found m Perou, et al. 
Nature, 2000, and on inventors' Web site (http://genome- 
15 www.stanfonLedu/molecularportraits/). 

Hie similarity of the gene expression profiles of individual tumor samples or 
groups of tumor samples to one another is inversely related to the length of the 
branches that connect them. Thus, for example, adjacent tumor samples connected to 
one another by short vertical branches descendmg from a common horizontal branch 
20 (e.g., tumor samples Norway 48-BE and Norway 48-AF close to the rigjit of the tumor 
dendrogram) are more closely related to one another in terms of their gene expression 
profiles than adjacent tumor samples connected to one another by longer vertical 
branches descending from a common horizontal branch (e.g., tumor samples Norway 
lOO-BE and Norway lOO-AF at the left side of the tumor dendrogram). To die extent 
25 that the gene e3q)ression programs dictate the biological properties and behavior of the 
tumors and reflect their physiological state and environment, it is expected that the 
clustering of the tumors reflects phenotypic relationships among them, e.g., tumor 
samples connected by short horizontal branches (i.e., located in close proximity to one 
another) are expected to exhibit shnilar phenotypic features. In the gene 
30 dendrograms, the pattem and length of the branches reflects the relatedness of the 
genes with respect to their expression profiles across the tumor samples. Similarly to 
the tumor samples, genes connected by short vertical branches are more similar to one 
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another in teims of expression profile than genes connected by longer vertical 
branches. 

The expression patterns of the genes were also displayed using a matrix 
format, with each row representing all of the hybridization results for a single cDNA 
S element on the array and each column representing the measured expression levels for 
all genes in a single sample. In this format, tumor samples widi similar patterns of 
expression across the gene set are close to each other along the horizontal dimension. 
Similarly, genes with similar expression patterns across the set of samples are close to 
each other along the v^ical dunension. To allow the patterns of expression to be 

1 0 visualized, the normalized expression value of each gene was represented by a colored 
box, using red to represent expression levels greater than the median and green to 
represent expression levels less than the median. In all microarray images, the 
brightest red color represents transcript levels at least 16-fold greater than the median, 
and the brightest green color represents transcript levels at least 16-foid below the 

IS median* This display format facilitates comparisons between genes and the 

recognition of significant patterns. Certain gene subsets of particular mterest are 
indicated by colored bars along the right side of the matrices. These subsets are 
discussed further in the gene subs^ application. 

20 IV. Identification of BSTP-Ras/Rerg and Studies Thereof 
A. Identification of BSTP-Ras/Rerg 

LMA.G.E. clone S03S81 was identified based on the expression pattern of its 
correspondmg mRNA among the 84 samples analyzed by microarray hybridization. 
In particular, transcripts corresponding to clone 503581 varied in abundance by at 

25 least 4-foId fi'om their median abundance in the sample set, among at least 3 of the 84 
samples. Thus the polynucleotide corresponding to clone 503581 was differentially 
expressed among the tumor samples, indicating its potential utility for classifying 
tumors. Numerical data indicating the measured expression of mRNA corresponding 
to clone 503581 (i.e., mRNA hybridizing to clone 503581) appear in Table 1. In 

30 addition, the expression profile and additional data related to clone 503581 are 
available at Applicants' Web site (http;//genome- 

www.stanford.edu/moIecularportraits/). As shown therein, the expression level of 
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mRNA corresponding to clone 503581 varied significantly among tumor samples. 

expressed in tumor samples Norway 27-BE, Norway 27-AF, and Stanford 24. The 
relative expression level was also high in tumors Norway 100-BE, Norway 100-AF, 
5 Norway 1 11, and Norway 19, among others. However, the relative e7q)ression level 
of mRNA corresponding to clone 503581 was particularly low in tumor sample 
Norway 14-BE and was also low in tumor samples Norway 109-BE, Norway 109-AF, 
Norway 1 01 -BE, Stanford 2-LN, among others. As is evident fix>m the discussion 
above, the presence of mRNA corresponding to clone 503581 reflects the expression 

10 of the gene from which the mRNA is transcribed. 

The expression profile of mRNA corresponding to clone 50358 1 was 
particularly noteworthy m that it closely resembled that of the estrogen receptor 1 
(ESRI) gene. Previous studies have demonstrated genes that have similar patterns of 
expression when measured across many samples of varying biology are significantly 

1 5 enriched for those genes that share roles fai common biologic processes (DeRisi, et al, 
Iyer, et aL, Ross, et al. Brown, et a/., Spelbnan, et al, Eisen, et al.). For example, 
Ross, et dl. demonstrated that the pattern of gene expression measured across sixty 
unrelated cell lines identified a set of genes that varied in a manner correlated with the 
measured doubling time of the cell lines. The correlated set of genes was highly 

20 enriched for those genes known to play roles in transiting the cell cycle. However, the 
fine patterns of gene expression within the larger set that varied in a manner correlated . 
with the variation in measured cell doubling time (approximately 1500 genes) 
distinguished genes involved in RNA splicing fit>m those involved m protein 
translation. The subtle regulation of gene expression that differed between these sets 

25 of genes involved in sunilar but aspects of cell physiology and allowed accurate 

functional categorization of known genes and likely those genes of unknown fimction 
that share expression patterns. While not wishing to be bound by any theory, 
Applicants suggest that the measurement of gene expression across many breast tumor 
samples such as that described above is likely to identify genes diat act in common 

30 physiologic processes that vary amongst the tumor samples. The co-variation of clone 
503581 wifii the estrogen receptor gene suggests that these two genes (and potentially 
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other genes that show similar patterns of expression in the microarray experiments 
described herein) might have common roles in breast physiology. 

The similarity in the expression profiles of clone 503581 and ESRI is evident 
in the microarray images and in Table 1 . For example, the relative expression level of 
5 ESRI was high in tumor samples Norway 27-BE, Norway 27-AF, Norway 100-BE 
and Norway 100-AF, among others, as was the expression of mKNA corresponding to 
clone 503581, and the relative expression level of ESRI was low in tumor samples 
Norway 14-BE, Norway 48-AF, and Stanford 2-LN. The similarity in the expression 
profiles of the gene corresponding to clone 503581 and ESRJ is evidenced by the fact 

1 0 that the correlation coefficient (a well known measure of statistical similarity) 
between the dataset representing expression of the gene corresponding to clone 
503581 and the dataset representing e}q>ression of ESRI is greater than the correlation 
coefficient between tiie dataset representing expression of the gene corresponding to 
clone 503581 and the dataset representing e?q)ression of any other gene among the 

15 genes represented on the microarray. In addition, the similarity in the expression 
profiles of the gene corresponding to clone 50358 1 and ESRI is confirmed by the 
results of application of the hierarchical clustering algorithm to the entire set of tumor 
samples analyzed. While not wishing to be bound by any theory, these results suggest 
that clone 503581 may share a role in biologic processes related to tiiose in whidi the 

20 estrogen receptor is involved. 

B. Cloning and Sequence Analvsis of BST-Ras/Rerg 

A search of Genbank revealed that only a small portion at the 3' end and a 
small portion at the 5' end of clone 503581 had been sequenced. Therefore, clone 

25 503581 was sequenced and the fiill lengdi sequence (SEQ ID NO: 3) was obtained. 
An overlapping clone (IMA.G.E. clone 28777) was identified by searching Genbank 
for clones homologous to clone 503581. A consensus nucleotide sequence, SEQ ID 
NO: 4 was derived based on sequencing and analysis of overlapping 1MA.GK clone 
28777. In SEQ ID NO: 3 and SEQ ID NO: 4, the abbreviation N stands for any 

30 nucleotide. The consensus sequence (SEQ ID NO: 4) was used as input for a search 
of Genbank with the BLASTX program (which translates a nucleotide sequence m 
each of the possible six reading fi^es and then searches for homologous amino acid 
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sequences). The search indicated the presence of a region encoding an amino acid 
sequence with homology to the Ras famiiy of smaii guanine nucleotide-binding (GTF- 
binding) proteins. A corresponding open reading frame (SEQ ID NO: 2) was 
identified within the consensus sequence. The predicted amino acid sequence for the 
polypeptide encoded by this open reading frame is presented as SEQ ID NO: 1 . A 
full length cDNA encoding the polypeptide of SEQ ID N0:1, i.e., BSTP-Ras/Reig, 
was assembled from the IMj\.GJB. clones listed above and publicly available ESTs. 
The sequence of the fiill length cDNA is presented in Figure 2 as SEQ ID N0:5. 

Thus in one embodiment; the invention encompasses a polypeptide 
comprising the ammo acid sequence of SEQ ID NO: 1 . The polypeptide is 199 amino 
acids in length. As shown in Figure 3, a search of the GenBank database usmg the 
BLASTP computer program (Altschul, Stephen F., Thomas L. Madden, Alejandro A. 
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs". Nucleic Acids Res. 25:3389-3402) performed with fliis sequ^ce indicated 
that fliis polypeptide displays a high level of homology wifli members of flie Ras 
family, consistent witii flie infomation obtamed from the BLASTX seardi described 
above. 

As discussed below, expression of the gene encoding the polypeptide of 
SEQ ID N0:1 is regulated by estrogen, and the polypeptide exhibits growth-mhibitory 
eflfects. For these reasons the gene that encodes the polypeptide of SEQ ID NO: 1 is 
also known as Rerg (Ras-related and estrogen-regulated growth-inhibitory gene), and 
the polypeptide is known as Rerg. To reflect these names, tiie polypeptide comprising 
the ammo acid sequence presented in SEQ ID N0:1 will be referred to herein 
mterchangeably as BSTP-Ras, BSTP-Ras/Rerg, or Rerg, and the gene encoding the 
polypeptide will be referred to herein interchangeably as BST-RAS, BST-RAS/Rerg, or 
Rerg. The BST-RAS/Rerg gene is located on chromosome 12 within two overlapping 
BAG clones (AC007543 and AA022334) and is in close proxfanity to the K-Ras gene. 
The gene is spread out over approximately 110 kB of genomic DNA and is contained 
in four exons. The first exon of at least 170 bp is separated from the second 57 base 
pair exon by an intron of over 95 kB. While the cloned cDNA sequence starts at 
position -109 relative to the putative initiator methionine, the predicted transcription 
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start site as detennined from analysis of the genomic sequence suggests that the actual 
start site may be at position -780. Analysis of the BST-RAS/Rerg gene sequence 
identified two consensus ER binding sites within the 5' upstream region. 

GTP-binding proteins play a key role in cell cycle control and also in a number 
5 of other physiological and regulatory processes including cell signaling, growth, • 
dififerentiation, protein synthesis, and cytoskeletal dynamics, and cell movement. For 
example, Ras proteins play a major role in transducing extracellular signals to tiie 
nucleus to stunulate cell proliferation or differentiation. The Ras proteins belong to 
the large Ras superfamily of monomeric GTPases, which also contains three other 

10 subfamilies: (1) the Rho and Rac proteins, which transduce signals from receptors on 
the cell surface to the actin cytoskeleton; (2) the Rab &mily, involved in regulating 
the movement of intracellular transport vesicles; and (3) the Ran family, involved m 
the regulation of cell cycle progression, nuclear envelope structure and function, and 
nucleocytoplasmic transport 

IS Despite sequence differences between subfamily members, Ras-related 

GTPases contain five highly conserved domains (Gl-GS) and function as guanine- 
nucleotide-dependent molecular switches. Recently sev^ novel Ras-related 
GTPases have been identified that exhibit amino acid sequence features that 
distinguish them firom Ras proteins (reviewed in Reuther, G. W., and Der, C. J., Curr 

20 Opin Cell Biol 12(2), 1 57-65, 2000). These include ARHJy an imprinted tumor 

suppressor gene in breast and ovarian carcinomas; expression of ARHl in cancer cells 
inhibited cell growth and was associated with down-regulation of cyclin Dl promoter 
activity and induction of p2r^''^' (Yu, Y., Xu, F., Peng, H., Fang, X., Zhao, S., Li, 
v., Cuevas, B., Kuo, W. L., Gray, J. W., Siciliano, M., Mills, G. B., and Bast, R. C, 

25 Jr., Proc Natl Acad Sci USA 96(1), 214-9, 1999). 

Like other monomeric GTP-binding proteins and G protems, Ras functions as 
a molecular switch, cycling between two confomriational states: a GTP-bound active 
form and an inactive GDP-bound form (Boume, H. R., Sanders, D. A., and 
McCormick, F. (1991) Naiure 349(6305), 1 17-27). Transitions between tiiese states 

30 are regulated by two classes of protems. GTPase-activating proteins (GAPs) 

inactivate Ras by mcreasing the rate of hydrolysis of bound GTP by Ras. Guanme 
nucleotide releasmg proteins (GNRPs) stimulate the exchange of bound GDP for 
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GTP, thus leading to activation of Ras. GAPs and GNRPs are regulated, either 
directly or indirectly, through interactions with receptor and nonreceptor tyiosine 
kinases. GTP-bound Ras is active and capable of interacting (e.g.,via a Ras elTector 
domain) with downstream effectors such as the kinases Raf and phosphatidyl inositol 
(3) kinase, which leads to a signal transduction cascade involving proteins of the 
mitogen-activated protem (MAP) kinase femily (among others) that ultimately 
modulates the activity of transcription factors in tiie nucleus. Additional mformation 
on Ras and other members of the Ras superfemily may be found, for example, in 
Rommel, C. and Hafen, C, Ras - a versatile cellular switch, Cvrr. Op, Genet & Dev., 
8: 412-418, 1998 and references therein, and in Hesketh, Robin, The Oncogene and 
Tumor Suppressor Gene Factsbook (Factsbook Series), edition. Academic Press, 
San Diego, 1997). 

Mutations in genes of the Ras fimily play a significant role in a variety of 
cancer types including cancers of the lung, pancreas, colon, and myeloid systems and 
various other organs. In general, mutations that convert a normal Ras gene into an 
oncogenic (cancer-causing) form result in activation of the protein, e.g., by locking it 
in the GTP-bound state. An extensive discussion of Ras and the role of mutated Ras 
genes in cancer may be found in Franz Watzinger, Thomas Lion, **RAS family". Atlas 
Genet Cytogenet Oncol Haematol, March 1999, which is reproduced at 
http://www.infobiogen.fr/services/chromcancer/Deep/ras.htmI. Activating Ras 
mutations can be found in human malignancies with an overall frequency of 15-20%. 
The three human Ras &mily members that have been most widely associated with 
cancer are known as H-Ras, K-Ras, and N-Ras. A high incidence of Ras gene 
mutations has been reported in malignant tumors of the pancreas (80-90%, K-ras) 
(Ahnoguera C et al. Most human carcinomas of the exocrine pancreas contain mutant 
c-K^ras genes. Cell 53: 549, 1988; Smit VT et al. K-RAS codon 12 mutations occur 
very fi«qu«itly in pancreatic adenocarcinomas. Nucleic Acids Res, 16: 7773, 1988), in 
colorectal carcinomas (30-60%, K-ras) (Breivik J et al K-ras mutation in colorectal 
cancer relations to patient age, sex and tumour location. Br J Cancer 1994; 69: 367, 
1994; Spandidos DA et al. Ras oncogene activation in benign and malignant 
colorectal tumours. Tumori,, 81(3 Suppl):7, 1995), in lung (25-60%, K-ras) 
(Husga^el-Pursiainen K et al. Detection of ras gene mutations in human lung cancer: 
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comparison of two screening assays based on the polymerase chain reaction. Environ 
Health Perspect 98: 183-5: 183, 1992; Rodenhuis S, Slebos RJ. Clinical significance 
of ras oncogene activation in human lung cancer. Cancer Res, 52: 2665s, 1992; 
Sagawa M et al. K-ras point mutation occurs in the early stage of carcinogenesis in 
5 lung cancer. Br J Cancer IT. 720, 1998) in hematopoietic neoplasia of myeloid origin 
(18-30%, K-and N-ras) (Nakagawa T et al. Multiple point mutation of N-ras and K-ras 
oncog^es in myelodysplastic syndrome and acute myelogenous leukemia. Oncology 
1992; 49: 1 14, 1992; Neubauer A et al. Prognostic importance of mutations in the ras 
proto-oncogenes in de novo acute myeloid leukemia. Bloody 83: 1603, 1994). Ras 

10 mutation or overexpression has also been implicated in breast cancer (Miyakis, S. et 
al.. Differential expression and mutation of the ras &mily genes in human breast 
cancer, Biochem Biophys Res Commun, 251(2):609, 1998; Rochlitz CF et al. 
Incidence of activating ras oncogene mutations associated with primaiy and metastatic 
human breast cancer. Cancer Res 49: 357, 1989; Spandidos DA. Oncogene activation 

1 5 in malignant transformation: a study of H-ras in human breast cancer. Anticancer Res; 
7: 99, 1987). It is noted that the term "activity" in reference to Ras family members 
may refer to either the biological activity or activities of the protein, e.g., its effects on 
some aspect of the phenotype of a cell or organism, or to particular biochemical 
activities of binding GTP or hydrolyzing GTP. Thus refer^ce to an mcrease or 

20 decrease in Ras "activity" can be ambiguous unless it is understood which activity is 
being referred to, which can usually be ascertained from the context In general, 
unl^s otherwise stated, a reference to Ras "activity" herein refers to the effect of the 
Ras protein on cell or organism phenotype rather than to a particular biochemical 
activity. For example, the increased activity of ^activated' H-Ras proteins refers to an 

25 increased ability to enhance cell proliferation and/or tumorogenicity. Such activation 
is mediated by Ras mutants that have reduced GTPase activity. Conversely, increased 
activity of BSTP-Ras/Rerg refers to increased ability to inhibit cell proliferation 
and/or tumorigenicity. Such activation may be mediated by BSTP-Ras/Rerg mutants 
or variants that have reduced GTPase activity. The growth inhibitory activity 

30 demonstrated in Figure 8 is consistent with the biologic activity of other Ras family 
members thadiave been implicated directly in second messanger pathways that 
regulate cell growth. However, the specific pathways modulated by BSTP-Ras/Rerg 
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remain to be defined and may include activities whose secondary effect is modulation 

-.^'^-It,.! . ^l. 

Ul UCIIUIOI ^lUWlli. 

Figure 6 presents an alignment of BSTP-Ras/Rerg with several other Ras 
family members. The alignment was performed with the CLUSTAL W1.6 program 
5 (Thompson, J. D., Higgins, D, G., and Gibson, T. J., Nucleic Acids Res 22(22), 4673- 
80, 1994). Hyphens represent gaps introduced for optimal alignment. Numbers are 
residue numbers. Amino acid residues that were conserved in at least four of the five 
proteins in the alignment are placed in shaded boxes and amino acid similarity is 
indicated by Ime boxes. Consensus sequences for GTP-bindmg regions (Gl-GS) 

10 (heavy overline) and the position of conserved CAAX motif (dashed underline) are 
labeled and highlighted. Similar alignments with other Ras &mUy members can be 
readily obtained, thus providing additional information about which sites in BSTP- 
Ras/Rerg correspond to well-studied sites in o&er Ras proteins. 

Hie predicted amino acid sequence of the Rerg protein is 50% and 47% 

15 identical to the Dictyostelium Ras-D and Ras-S proteins respectively, and 44% to 45% 
identical to human Rit, TC21/R-Ras2, M-Ras/R-Ras3, Rin, and Raplb. Although 
Rerg shares significant sequence identity and organization with Ras, it also contams a 
number of unique features. First, Rerg has an alanine at the position corresponding to 
the glycine at position 12 of Ras. Substitution of this residue with alanine results m 

20 the constitutive activation of Ras (Barbacid, M. (1987) Armu Rev Biochem 56, 779- 
827). Another unique feature of Rerg is that it lacks any known COOH-terminal 
prenylation motif suggesting that it is not subject to post-translational prpnylation 
(Casey, P. J., and Seabra, M. C. (1996) J Biol Chem 271(10), 5289-92; Glomset, J. A., 
and Famsworth, C. C. (\99A) Annu Rev Cell Biol 10, 181-205). Prenylation facilitates 

25 the association of Ras with the plasma membrane and nonprenylated mutants of Ras 
are rendered completely inactive. Instead, stretches of basic residues are found in the 
Rerg COOH-tenninal region. These unique COOH-terminal fisatures of Rerg are 
similar to those of Rit and Rin (Wes, P. D., Yu, M., and Montell, C, Embo J 15(21), 
5839-48, 1996; Lee, C. H., Delia, N. G., Chew, C. E., and Zack, D. J., JNeurosci 

30 16(21), 6784-94, 1996). 

C. Biological Characterization of BSTP-Ras/Rerg 
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While not wishing to be bound by any theory, as discussed in Examples 12 
and 13, the BST-RAS/Rerg gene and its encoded polypeptide display various 
characteristics that suggest a role for BSTP-Ras/Rerg in inhibiting cell growth and/or 
tumor development or growth. As demonstrated in Example 12, both stable and 
5 transient over-expression of BSTP-Ras/Rerg in MCF-7 cells inhibited their 

proliferation. Based on the results of two widely used assays for oncogenicity (colony 
growth in soft agar and tumor formation in nude mice), cells ovmxpressing BSTP- 
Ras/Rerg have significantly reduced tumorigenicity. MCF-7 cells stably 
overexpressihg BSTP-Ras/Rerg showed greatly reduced colony forming efiFiciency 

10 (SO-80% reduction) in soft agar as compared to control MCF-7 cells not 

overexpresslng BSTP-Ras/Rerg. Furtheimore, as demonstrated in Example 13, MCF- 
7 cells stably overexpresslng BSTP-Ras/Rerg at high levels showed decreased 
tumorigenicity m nude mice as compared to control MCF-7 cells in that nude mice 
mjected with MCF-7 cells overexpressmg BSTP-Ras/Rerg at high levels remained 

1 S tumor free for significantly longer than mice injected with control MCF-7 cells. 

In addition to the biological data in cell lines and animals, the e3q)ression of 
BST-RAS/Rerg across the tumor samples was compared wifli the e^qpression pattern of 
the ""proliferation" cluster of g^es described in more detail in copending U.S. Patent 
Application ""Reagents and Methods for Use in Managing Breast Cancer, filed July 

20 26, 2001 and in U.S. Provisional Patent Application Sen No. 60>^20,967, filed July 
26, 2000. As described therein, the ""proliferation" cluster is a group of approximately 
120 genes whose level of expression correlates with cellular proliferation rates (See 
Perou, C, et a/., 1999; Ross, D., et cd.. Nature Genetics, 24(3): 227-35, 2000.). 
Expression of this subset of genes varied widely among the tumor samples, and was 

25 generally well correlated with a standard pathological index of tumor cell 

proliferation, namely the mitotic mdex. Tumors that most highly express BST- 
RAS/Rerg tend to be die least proliferative, and the ones with low to no BST-RAS/Rerg 
expression tend to be the most proliferative. BST-RAS/Rerg expression was decreased 
or lost in a significant percentage of tumors that show a poor clinical prognosis. BST- 

30 RAS/Rerg is a member of the "luminal epithelial/ER+" gene cluster, which contains 
most of the genes whose high expression was associated with a good prognosis (i.e., 
long survival times). The luminal gene subset includes a number of genes known to 
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be expressed in luminal epithelial cells of the nonnal mammary gland (e.g., the ER 
and the prolactin receptor) and is characterized in that certam of the genes in the 
subset are overexpressed at the RNA level in samples obtaihed 6om a subset of 
tumors that had a good prognosis relative to the mtire group of tumors (the luminal 
5 group is further described m copendmg U.S. Patent Application ""Reagents and 
Methods for Use in Managing Breast Cancer, filed July 26, 2001 and in U.S. 
Provisional Patent Application Ser. No. 60/220,967, filed July 26, 2000). The luminal 
gene subset is identified with a purple bar along the side of the color matrbc m part b 
and is shown in expanded form in part c. The clones that identify the genes in the 

10 luminal subset are listed by their Genbank accession numbers in Table 8. 

Thus a number of lines of evidence indicate that BSTP-Ras/Rerg has growth- 
mhibitory effects. The invention therefore provides methods for inhibiting growth of 
a cell comprising increasmg the activity of BSTP-Ras/Rerg. In obtain embodiments 
of the mvention the methods comprise increasing the GTP-binding activity and/or 

15 decreasing the GTPase activity of BSTP-Ras/Rerg. Increasing tiie GTP-binding 
activity and/or decreasing the GTPase activity of BSTP-Ras/Rerg can be 
accomplished in any of a variety of ways. For example, these effects may be achieved 
by making certain mutations in tiie BSTP-Ras/Rerg coding sequaice. Such mutations 
may be made based on the homology of BSTP-Ras/Rerg witii oflier Ras fimily 

20 members. The mutations may be made and tested in vitro, e.g., in an expression 
construct, which may th^ be introduced into the cell. Hie effects can also be 
achieved by contactmg BSTP-Ras/Rerg witii a compound fliat activates or inhibits 
BSTP-Ras/Rei:g level or activity. Methods fi)r identifying such compounds are 
described above. 

25 Methods for early detection of cancer by screening for mutations in Ras genes 

are currently being explored (K-ras mutation: early detection in molecular diagnosis . 
and risk assessment of colorectal, pancreas, and lung cancers-a review, Cancer 
Detect Prev. 24(1):1-12, 2000). Ras proteins are also targets for the development of 
new therapeutic agents for cancer Taveras, AG, et al., BioorgMed Chem Ras 

30 oncoprotein inhibitors: the discovery of potent, Ras nucleotide exchange inhibitors 
and the structural determination of a drug-protem complex, 5(l):125-33, 1997. In 
addition, as mentioned above, the gene expression profile of the gene encoding BSTP- 
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Ras/Rerg (i.e., BST-RAS/Rerg) closely parallels the expression profile of the estrogen 
receptor gene ESRL As is well known in the art, tumors that express the estrogen 
receptor (ER) have a better prognosis than tumors that do not express ER (reviewed in 
Alhred, D.C., et al). In addition, measurement of estrogen receptor levels in breast 
5 tumor samples is used clinically for prognostic and therapeutic decisions. Since the 
expression profile of the BST-RAS/Rerg gene closely resembles that of the estrogen 
receptor gene, determining the e^ipression of BST-RAS/Rerg may be used to provide 
prognostic information for breast cancer patients. Determming the expression of BST- 
RAS/Rerg may include.measuring mRNA transcribed firom BST-RAS/Rerg^ e.g., 

10 mRNA comprising a nucleotide sequence set forth in SEQ ID N0:2, 3, 4, SEQ ID 
NO:S, or portions or variants diereof Such measurements can be made using a 
variety of techniques including, but not limited to. Northern analysis and microarray 
hybridization. Determining the expression of BST-RAS/Rerg may include detecting 
(qualitatively or quantitatively) a translation product of BST-RAS/Rerg, e.g., a 

IS polypeptide comprising an amino acid sequence set forth in SEQ ID NO:l or a 

polypeptide having significant similarity to the polypeptide of SEQ ID NO:L A wide 
varied of mettiods for detecting and measuring protein levels are known m the art. 
For example, the detecting can be performed using immunohistochemistiy or other 
antibody-based approaches including Western blots, use of antigen-antibody protein 

20 arrays, etc. 

The discovery and characterization of BSTP-Ras/Rerg and the discovery of the 
differential expressidn of the gene encoding BSTP-Ras/Rerg satisfy a need in die art 
by providing compositions useful in the diagnosis, treatment, and prevention of cancer 
and by providing methods useful in the classification of cancer, the provision of 

25 prognostic information to patients with cancer, and the identification of new 

compounds for inhibiting cell growth and for the treatment of cancer. BST-RAS/Rerg 
shares a pattern of expression similar to the estrogen receptor gene, a gene known io 
play a role in growdi control of normal and tumor breast cells, and already targeted for 
mhibition m both prevention and treatment of breast cancer. Overexpression of 

30 BSTP-Ras/Rerg inhibits cell growth in vitro and tumor formation in an anunal model. 
These results suggest that BST-RAS/Rerg may be a useful gene to target for 
therapeutic intervention in subsets of breast cancer and a useful gene to distinguish 
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between different subsets of breast cancer. 
V. Further Aspects of tfie Invention 

A. Polynucleotides, polypeptides, antibodies, vectors, and host cells 
5 The invention also encompasses polypeptides possessing significant similarity 

to BSTP-Ras/Rerg, i.e., polypeptides whose sequence possesses signficant similarity 
to the sequence of SEQ ID NO: 1 . In certahi embodunents of the invention a 
significantly similar polypeptide is encoded by a human gene. Definitions of 
"significantly similar^' make reference to the BLAST algoriflun and BLOSUM 
10 substitution matrix, which are described m Altschul, SF, et al., "Gqjped BLAST and 
PSI-BLAST: a new generation of protein database search programs". Nucleic Acids 
Res. 25:3389-3402, 1997 and Henikofl^ S. and Henikofl^ J., "Ammo acid substitution 
matrices fi-om protein blocks*', Proc. Natl Acad Set 89, 10915-10919, 1992. 
In certain embodiments of the invention a polypeptide is considered 

15 significantiy sunilar if, when the ammo acid sequence of the polypeptide is compared 
wifli tiie ammo acid sequence of the polypeptide of SEQ ID N0:1 usmg tiie BLAST 
algorithm and the BLOSUM substitution matrix with de&ult parameters^ the result is 
a % identity greater than 60 or a % positive greater tiian 70 encompassing at least 25% 
of the length of SEQ ID NO: 1, or botii. In other embodiments of the invention a 

20 polypeptide is considered significantiy similar i1^ when the amino acid sequence of the 
polypeptide is compared with the amino acid sequence of the polypeptide of SEQ ID 
N0:1 using the BLAST algorithm and the BLOSUM substitution matrfac witii default 
parameters, the result is a % identity greater than 60 or a % positive greater than 70 
encompassmg at least 50% of the length of SEQ ID NO: 1, or both. In otiier 

25 embodiments of the invention a polypeptide is considered significantly similar if, 
when the amino acid sequence of the polypeptide is compared with the 
. sequence of die polypeptide of SEQ ID N0:1 using the BLAST algorithm and the 
BLOSUM substitution matrix with default parameters, the result is a % identity 
greater than 60 or a % positive greater than 70 encompassing at least 75% of the 

30 length of SEQ ID NO: 1, or both. In other embodiments of the invention a polypeptide 
is considered significantly similar if, when the amino acid sequence of the polypeptide 
is compared with the amino acid sequence of the polypeptide of SEQ ID NO:l using 
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the BLAST algorithm and the BLOSUM substitution matrix with default parameters, 
the result is a % identity greater than 60 or a % positive greater than 70 encompassing 
at least 90% of the length of SEQ ID N0:1, or both. In other embodiments of the 
invention a polypeptide is considered significantly similar if, when the amino acid 
S sequence of the polypeptide is compared with the amino acid sequence of the 
polypeptide of SEQ ID N0:1 ustag the BLAST algorithm and the BLOSUM 
substitution matrix with de&ult parameters, die result is a % identity greater than 60 
or a % positive greater than 70 encompassing at least 95% of the length of SEQ ID 
NO:l,orboth. 

10 In other embodiments of the invention a polypeptide is considered 

significantly similar if, when the amino acid sequence of the polypeptide is compared 
with the amino acid sequence of the polypeptide of SEQ ID NO: 1 using the BLAST 
algorithm and the BLOSUM substitution matrix with de&ult parameters, the result is 
a % identity greater than 70 or a % positive greater than 80 encompassing at least 25% 

15 of the length of SEQ IDNOrl, or both. In other embodiments of tiie invention a 

polypeptide is considered significantly similar i^ when the amino acid sequence of the 
polypeptide is compared with the amino acid sequence of the polypeptide of SEQ ID 
NO:l usmg the BLAST algorithm and the BLOSUM substitution matrix with de&ult 
parameters, the result is a % identity greater than 70 or a % positive greater than 80 

20 encompassing at least 75% of the length of SEQ ID NO: 1, or both, fa other 

embodiments of the invention a polypeptide is considered significantly similar i^ 
wh^ the amino acid sequence of the polypeptide is compared with the amino acid 
sequence of the polypeptide of SEQ ID NO: 1 usmg the BLAST algorithm and ttie 
BLOSUM substitution matrix with default parameters, the result is a % identity 

25 greater than 70 or a % positive greater than 80 encompassing at least 90% of tfie 

length of SEQ ID N0:1, or both. In other embodiments of the invention a polypeptide 
is considered significantly similar if, when the amino acid sequence of the polypeptide 
is compared with the amino acid sequence of the polypeptide of SEQ ID NO: 1 using 
the BLAST algorithm and the BLOSUM substitution matrix with default parameters, 

30 the result is a % identity greater than 70 or a % positive greater than 80 encompassing 
at least 95% of the length of SEQ ID NO:l, or both. 
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In other embodiments of the invention a polypeptide is considered 
significantiy similar if, when the amino acid sequence of the polypeptide is compared 
with the amino acid sequence of the polypeptide of SEQ ID N0:1 using the BLAST 
algorithm and the BLOSUM substitution matrix with defeult parameters, the result is 
5 a % identity greater than 80 or a % positive greater tfian 90 encompassing at least 25% 
of the length of SEQ ED N0:1, or both. In other embodiments of tiie invention a 
polypeptide is considered significantly similar if, when the amino acid sequence of the 
polypeptide is compared with the amino acid sequence of the polypeptide of SEQ ED 
N0:1 usmg the BLAST algorithm and the BLOSUM substitution matrix with defeult 
1 0 parameters, the result is a % identity greats than 80 or a % positive greater than 90 
encompassing at least 75% of the length of SEQ ID NO:l, or both. In other 
embodiments of the invention a polypeptide is considered significantly similar if, 
when the amino acid sequence of the polypeptide is compared with the amino acid 
sequence of the polypeptide of SEQ ID NO:l using flie BLAST algorithm and the 
15 BLOSUM substitution matrfac with defeult parameters, the result is a % identity 
greater than 80 or a % positive greater than 90 encompassmg at least 90% of the 
length of SEQ ID NO: 1, or both. In other embodiments of the invention a polypeptide 
is considered significantly similar if, when the ammo acid sequence of the polypeptide 
is compared with the ammo acid sequ^ce of flie polypeptide of SEQ ID N0:1 using 
20 the BLAST algorithm and the BLOSUM substitution matrix with defeult parameters, 
the result is a % identity greater than 80 or a % positive greater than 90 encompassing 
at least 95% of the length of SEQ ID N0:1, or both. 

In other embodiments of the invention a polypeptide is considered 
significantiy similar i^ when the ammo acid sequence of the polypeptide is compared 
25 with the amino acid sequence of flie polypeptide of SEQ ID N0:1 using the BLAST 
algorithm and the BLOSUM substitution matrix with default parameters, the result is 
a % identity greater than 90 or a % positive greater than 95 encompassmg at least 25% 
of the length of SEQ ID NO: 1, or both. In oflier embodiments of the invention a 
polypeptide is considered significantly similar if, when the amino acid sequence of the 
30 polypeptide is compared with the amino acid sequence of the polypeptide of SEQ ID 
N0:1 using the BLAST algorithm and the BLOSUM substitution matrix with defeult 
parameters, the result is a % identity greater than 90 or a % positive greater than 95 
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encompassing at least 75% of the length of SEQ ID NO:l, or both. In other 
embodunents of the invention a polypeptide is considered significantly similar ifc 
when the amino acid sequence of the polypeptide is compared with the amino acid 
sequence of the polypeptide of SEQ ID NO: 1 using tiie BLAST algorithm and the 
5 BLOSUM substitution matrix with default parameters, the result is a % identity 
greater than 90 or a % positive greater than 95 encompassing at least 90% of the 
length of SEQ ID NO: 1, or both. In other embodiments of the invention a polypeptide 
is considered significantly similar if, when the amino acid sequence of the polypeptide 
is compared with the amino acid sequence of the polypeptide of SEQ ID NO: 1 using 

1 0 the BLAST algorithm and the BLOSUM substitution matrix with de&ult parameters, 
the result is a % identity greater than 90 or a % positive greater than 95 encompassing 
at least 95% of the length of SEQ ID NO: 1 , or both. By "encompassing at least X% 
of the length of SEQ ID NO: 1 " (or, in general, SEQ ID NO:Y) is meant that tiie 
length of the portion of SEQ ID NO: 1 that is being compared with a potentially 

1 5 sunilar protein is at least X% of the length of SEQ ID NO: 1 (or, in general, SEQ ID 
NO:Y). 

In certain embodiments of the invention a polypeptide having significant 
sunilarity to the polypeptide of SEQ ID NO:l includes one or more conservative 
amino acid substitutions. Examples of conservative substitutions are well known in 

20 the art. See, for example. Biochemistry, 4th Ed., Stryer, L., et aL, W. Freeman and 
Co., 1995 and U.S. Patent No. 6,015,692. The uivention also encompasses variants of 
the polypeptide of SEQ ID NO:l and variants of significantiy similar polypeptide, 
wherein the variants have one or more altered or modified amino acids. Alterations 
and modifications may include the replacement of an L- amino acid with a D-amino 

25 acid, or various modifications including, but not limited to, phosphorylation, 
carboxylation, alkylation, etc. 

Certain polypeptides having significant similarity to the polypeptide of SEQ 
ID NO:l contain at least one functional or structural characteristic of BSTP-Ras/Rerg. 
For example, certain of the polypeptides contain an epitope that binds an antibody 

30 that binds to BSTP-Ras^erg, Certain of the polypeptides have amino acid sequences 
that differ by less than 20, less than 10, or less than 5 amino acids from the amino acid 
sequence of SEQ ED NO: 1. Certain or the polypeptides contain a GTP-binding 
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domain and/or a Ras effector domain. Certain of the polypeptides retain at least one 
biologiual activity, structural feature, or immunological activity of BSTP-Ras/Rerg. . 

The invention also includes jfegments of a polypeptide comprising the 
sequence of SEQ ID NO: 1 and fragments of polypeptides having significant similarity 
to the polypeptide of SEQ ID NO:l. In certain embodiments of the invention the 
fragments are between S and 15 amino acids in length, inclusive. In certain 
embodiments of the invention the fragments are between 10 and 15 amino acids in 
length, inclusive. In certam embodiments of the invention the fi-agments are between 
10 and 20 amino acids in length, mclusive. In certain embodiments of the invention 
the fragments are between 20 and 30 amino acids in length, inclusive. Such fragments 
are usefiil, for example, as antigens for the generation of antibodies. In other 
embodiments of the invention the fragments include a characteristic feature of a Ras 
femily member such as a GTP-binding domain or a Ras effector domain. In certam 
embodiments of the invention the lengtii of the fragments is at least 50%, at least 
75%, at least 90%, or at least 95% of the frill lengtii of BSTP-Ras/Rerg. 

The invention also includes polynucleotides that encode BSTP-Ras/Rerg, i.e., 
polynucleotides that encode a polypeptide comprising the sequence of SEQ ID N0:1. 
In one embodiment the invention encompasses a polynucleotide comprising the 
polynucleotide sequence of SEQ ID NO:2. In other embodiments tiie invention 
encompasses a polynucleotide comprising the polynucleotide sequence of SEQ ID 
NO: 3, SEQ ID NO: 4, or SEQ ID N0:5. The mvention fiirfhcr mcludes 
polynucleotides that encode the polypeptides having significant similarity to the 
polypeptide of SEQ ID N0:1 described above. In addition, the invention includes 
fragments of the foregoing polynucleotides, wherein the fragments are at least twelve 
nucleotides in length, at least 15 nucleotides in length, at least 20 nucleotides in 
lengtii, or at least 25 nucleotides in length. In certain embodimente of the invention 
the length of a fragment is at least 50%, at least 60%, at least 60%, at least 80%, at 
least 90%, at least 95%, or at least 99% of tiie fiiU lengtii of tiie corresponding 
polynucleotide. 

As is well known in the art, due to tiie degeneracy of the genetic code (i.e., the 
fact that m many cases multiple different codons can code for the same amino acid), 
multiple different polynucleotide sequences encode BSTP-Ras/Rerg of the present 
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invention. The invention encompasses all of the sequences that can be made by 
substituting alternative codons in accordance with the genetic code. It is noted that 
such substitution must be made appropriately in view of the reading frame of the 
polynucleotide. It is fiirther noted that many of these polynucleotide sequences will 
5 display little or no homology with tiie sequences in SEQ ID NO: 2, SEQ ID NO: 3, 
SEQ ID NO: 4or SEQ ID NO: S. In certain embodiments of the invention it may be 
preferred to employ polynucleotides encoding BSTP-Ras/Rerg having a significantly 
different codon usage to that found in naturally occurring BSTP-Ras/Rerg. For 
example, if the inventive polynucleotides are to be used to express BSTP-Ras/Rerg m 

10 ' a heterologous system such as a bacterial or yeast expression system, it may be 
desirable to employ polynucleotides having a codon usage preferred for optimal 
expression in the heterologous system. Such codon usage preferences are well known 
in the art. Altering the nucleotide sequence encodmg BSTP-Ras/Reegg may have 
additional uses such as maximizing RNA stability, as it is well known in the art tiiat 

IS RNA stability can be affected by die sequence of the RNA. 

The invention also includes polynucleotides havmg a complementary 
nucleotide sequence to any of the inventive polynucleotides described above. Such 
complementary polynucleotides are useful as probes, e.g., to detect e^ipression of the 
inventive polynucleotides at the RNA level. Such complementary polynucleotides are 

20 also useful as antisense reagents, to inhibit the expression of the corresponding genes 
at the protein level, e.g., by mterfermg with mRNA translation. Inhibiting gene 
esqiression has a variety of applications, e.g., it may be used to gain information about 
the fimction of the ^coded protein. In addition, antisense inhibition of gene 
expression may be used therapeutically. 

25 The invention encompasses polynucleotides that are able to hybridi2e to any of 

the inventive polynucleotide sequences discussed above under various conditions of 
stringency. In general, a hybridizmg polynucleotide will have a sequence either 
partially or fully complementary with die polynucleotide to which it hybridizes. 
Hybridization conditions of various stringency are well known in the art and are, in 

30 general, governed by the concentration of reagents such as salts and formamide in the 
hybridization buffer as well as by the temperature at which hybridization is 
performed. For example, a stringent hybridization can be performed by use of a 
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hybridization buffer comprising 30% formamide in 0.9M saline/0.09M sodium citrate 
(SSC) buffer at a temperature of 45'' C followed by washing twice with that SSC 
buffer at 45** C. A moderately stringent hybridization condition could include use of a 
hybridization buffer comprising 20% formamide in 0.8M salme/0.08M SSC buffer at 
a temperature of 37° C. followed by washing once with that SSC buffer at 2T C. The 
selection of appropriate hybridization conditions will ^ically be determined by the 
purpose for which hybridization is to be carried out and is a matter of choice for the 
practitioner. When the polynucleotide is present on an array, the hybridization 
conditions may be chosen as described in Example 4. 

The invention further includes oligonucleotides comprising a fragment of a 
polynucleotide encoding BSTP-Ras/TRerg or comprising a fragment of a 
polynucleotide complementary to such a polynucleotide. In certain embodiments of 
the invention the oligonucleotides are between 6 nucleotides and 60 nucleotides in 
length. In certain embodiments of the invention the oligonucleotides are between 
about IS to 30 nucleotides m length, while in certain other embodiments of the 
invention the oligonucleotides are between about 20 and 25 nucleotides in length. 
Such oligonucleotides are useful, for example, as primers m PCR amplification or in 
hybridization assays includmg microarray assays. 

The invention contemplates the production of any of the 
polynucleotides,fiBgments, and variants thereof described herein by chemical 
synthesis, by PCR, or by use of expression vectors includmg tiie polynucleotides. 
Such expression vectors and methods of their use are well known in the art. The 
inventive polynucleotides and fragments thereof can be produced using an in vitro 
transcription system or withm a host cell contaming a vector comprising the mventive 
polynucleotide sequence and appropriate genetic control elements (e.g., enhancers, 
promoters, termmators) operatively linked to the polynucleotide sequence so as to 
direct transcription therefrom. In vitro transcription systems are well known in the art 
as are vectors containmg appropriate genetic control elements for directing 
transcription of an inserted polynucleotide sequence and host cells in which such 
vectors are maintained. The invention encompasses the production of either DNA or 
RNA havmg tiie sequence of an mventive polynucleotide. One of ordinaiy skill in 
the art will be able to select appropriate vectors and synthesis conditions depending 
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upon whether it is desired to produce DNA or RNA. It is noted that the inventive 
polynucleotides and fragments thereof can also be synthesized entirely through 
chemical means. Techniques and machines for chemical synthesis of polynucleotides 
are well known m the art. The polynucleotides can be labeled or conjugated with 
S detectable moieties including radionuclides, enzymes, chromogenic substrates, 
fluorescent substances, etc., using any of a variety of techniques. 

Polynucleotides encoding BSTP-Ras/Rerg can be extended, e.g., to identify 
upstream elements such as promoters or other regulatory elements, using techniques 
that are well known in the art. Such techniques are described, for example, in U.S. 

10 Patent No. 6,008, 337, which is herein incorporated by reference, and include a 
variety of PCR-based methods, screening of cDNA libraries, primer extension, etc. 
Genomic sequence such as introns can also be obtained. Discovery of additional 
sequence may also be performed using computer-based searches of sequenced human 
DNA. As is well known, large portions of the himian genome have been sequenced, 

1 S but relatively little information exists as to the structure and organization of much of 
the sequence. Thus extension of the inventive polynucleotide sequences may be 
performed by careful examination of previously sequenced genomic DNA. Preferably 
any predictions based on examination of genomic sequence are verified 
experimentally, since it is well known in the art that a significant number of errors 

20 exist m the genomic sequence and m the predictions (e.g., predictions of genes and 
open readmg frames) based thereon. 

It is well known in the art that different cell types, cells at different stages of 
differentiation, and/or cells within organisms at different developmental stages may 
express variants of the same gene, e.g., variants derived by alternative splicing. 

25 Tho'efore, multiple different mRNAs corresponding to the same gene may exist. 

Such mRNAs are transcribed from the same region of genomic DNA but may vary in 
sequence, usually lacking or containing domains corresponding to introns or regions 
at the S' and/or 3* end of the message. The present invention encompasses such 
variant polynucleotides and their encoded polypeptides. Variant polynucleotides can 

30 be identified and cloned by screening cDNA libraries produced using mRNA fix>m 
cells of various different types, differentiation states, or fix)m tissues at dififerent 
developmental stages. 
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The invention contemplates production of any of the BSTP-Ras/Rerg 
polypeptides or fragments llicreof using any of a variety of techniques including both 
in vivo or in vitro synthesis. For example, polynucleotides encoding an mventive 
polypeptide can be inserted into an expression vector, which can then be introduced 
5 into an appropriate host cell (e.g., a bacterial, yeast, insect, or mammalian cell). Thus 
the invention includes an expression vector comprising a polynucleotide comprismg a 
nucleotide sequence set forth m SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, or 
SEQ ID N0:5. The invention further includes an expression vector comprising a 
polynucleotide that encodes the polypeptide of SEQ ID NO: 1 or encodes a 

10 significantly similar polypeptide. The invention further mcludes a host cell 
comprising any of the afore-mentioned vectors. A wide variety of vector/host 
expression systems are known in the art. In general, vectors contain necessary control 
and regulatory sequences (e.g., enhancers, promotore, polyadenylation sequence, etc.) 
operatively linked to an inserted polynucleotide so as to dkect expression of the 

1 S polypeptide m the appropriate host cell. Dependmg upon the host cell to be 
employed, appropriate vectors may mclude phages, vmises, or plasmids. The 
inv^tion encompasses any available vector/host expression system and specifically 
includes vectors that direct expression of an mventive polynucleotide, vectors that 
direct expression of an inventive polypeptide (e.g., e3q)ression vectors), in addition to 

20 cells and cell lines transformed with such vectors. In certain embodhnents of the 
mvention the inventive polypeptide is secreted Srom a cell transformed whh an 
expression vector, thereby allowmg purification from the medium rather ttian by 
harvesting the cells. The mvention also encompasses the production of mventive 
polypeptides in cells that have been engineered to express such polypeptides 

25 according to the methods described m U.S. Patent No. 6,063,630, which discloses 
meOiods of 'turning on" an endogenous gene in cells that normally express the gene at 
low or undetectable levels. Methods 

for harvesting, isolating, purifying, etc., polypeptides fi^m cells expressing such 
polypeptides are well known in the art. 
30 BSTP-Ras/Rerg and polypeptides that are significantly similar to BSTP- 

Ras/Rerg can also be produced in animals or plants that are transgenic for a 
polynucleotide sequence encoding the polypeptide. The invention includes such 
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animals and plants. In addition to their potential use as sources for the inventive 
polypeptides, transgenic animals may be used to study the fiinction of the inventive 
polypeptides. Methods for the production of transgenic animals and plants, as well as 
methods for purifying and harvesting inventive polypeptides from such animals and 
S plants are well known in the art and are within the scope of &e invention. The 

invention also encompasses cells and transgenic anunals that have been engineered to 
lack expression of the inventive polynucleotides. Methods for "knocking ouf a gene 
using the technique of homologous recombination and methods of creating cells and 
organisms lacking expression of the knocked out gene are well known in the art and 

10 are described, for example, in U.S. Patent No. 5,464,764, U.S. Patent No. 5,487,992, 
U.S. Patent No. 5,627,059, and U.S. Patent No. 5,631,153. 

As will be appreciated by one of ordinary skill in the art, in certain 
circumstances it may be advantageous to modify an inventive polynucleotide 
sequence by ligating it to a heterologous sequence, thereby enabling the production of 

15 a fusion protem. Certain vectors are designed to incorporate such heterologous 
sequences so that insertion of a polynucleotide into the vector at an appropriate 
location results in an in-frame fusion to the heterologous sequence, which may be 
either upstream or downstream fix>m the inserted polynucleotide. Such heterologous 
sequences may encode tags or cleavable Imk^ sequences such as glutatiiione S- 

20 transferase (GST), the hemaglutinm epitope known as HA tag, a short stretch of the 
Myc protem (Myc tag), FLAG tag, 6X His tag, maltose bindmg protem tag, etc. In 
general, many of these tags are useful for the purification and/or detection of the 
polypeptide usmg an antibody or other reagent that binds to the tag. Other usefiil 
heterologous sequences mclude that of green fluorescent protein (GFP), which allows 

25 visualization of the fusion protein. The present invention encompasses all such 
BSTP-Ras/Rerg fiision proteins. 

The presence of a consensus sequence for GTP-binding regions in BSTP- 
Ras/Rerg indicates that the protein binds GTP and strongly suggests that its growth- 
inhibitory fimctions are mediated by the GTP-bound form. A decrease m the GTPase 

30 activity of BSTP-Ras/Rerg may therefore be expected to have similar growth- 

mhibitory effects to those mduced by overexpressing BSTP-Ras/Rerg. The invention 
provides fragments and variants (e.g., mutants) of BSTP-Ras/Rerg that have increased 
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activity. Certain of the firagments or variants have mcreased GTP-binding activity 

v/i xAwvAVMowu \j±x ao\^ ax^iiyiiy, mc iiivciiLiuii aib'o pruviues xragmenis or variants 
of BSTP-Ras/Rerg that have decreased activity. Certain of the fragments or variants 
have decreased GTP-bindmg activity and/or increased GTPase activity. 
5 Activating point mutations have been localized m codons 12, 13, 59, 61, 63, 

116, 117, 1 19, and 146 of the H-Ras protein (See, e.g., Der, C. J., Finkel, T., and 
Cooper, G. M, Cell 44(1), 167-76, 1986; Barbacid M., Eur J Clin Invest, 20: 225, 
1990). All of these alterations occur at or near the guanine nucleotide binding sites. 
The effects of these point mutations are either reduced GTPase activity (if amino acids 
10 12, 13, 59, 61, 63 are involved), so that oncogenic RAS mutants are locked in the 
active GTP-bound state, or decreased nucleotide affinity, and hence, increased 
exchange of bound GDP for cytosolic GTP (if amino acids 1 16, 1 17, 1 19 or 146 are 
affected). The inefiBcient deactivation of the active GTP-bound RAS protefas is 
intensified by the inability of GAPs to stimulate the conversion to the inactive, GDP- 
1 5 bound state. These point mutations cause an accumulation of activated RAS-GTP 
complexes, leading to continuous signal transduction by facilitatmg accumulation of 
constitutively active, GTP-bound RAS protein. 

The alignment of BSTP-Ras/Rerg wtth other Ras family members provides a 
basis for rational selection of mutations that are likely to increase or decrease BSTP- 
20 Ras/Rerg activity. However, the mutations are not limited to point mutations identical 
to those identified in other Ras femily members but may include different single 
ammo acid substitutions. Multiple ammo acid substitutions may also be made. In 
certain embodiments of the invention less than five amino acid substitutions are made. 
In certain other embodiments of the invention less than ten amino acid substitutions . 
25 arc made. In certain other embodhnentsofthe invention less than 20 amino acid 
substitutions are made. Activating mutations may also include deletions and/or 
additions. In certain embodiments of the invention the deletion or addition is a single 
amino acid In other embodiments of the invention the deletion or addition is less 
than five amino acids m length. In certain other embodiments of the invention the 
30 deletion or addition is less than ten amino acids m length. In certain other 

embodhnents of the invention the deletion or addition is less than 20 ammo acids in 
length. Combinations of the foregomg mutations are also mcluded in the invention. 
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While the range of possible mutations is broad, since the number of amino 
acids is limited it is possible to list all mutations that meet the various criteria outlined 
above. In addition, the range of mutations is limited in that the effects of mutations 
on the GTP-binding and/or GTPase activity of the mutated polypeptide may readily 
5 be tested according to standard methods that are well known in the art For sample, 
the time dependence and/or concentration dependence of guanine nucleotide binding 
to BSTP-Ras/Rerg or a fragment or variant thereoi^ or to a significantly similar 
polypeptide, the nucleotide specificity, and/or the guanine nucleotide dissociation rate 
may be determined by incubating the polypeptide with £q)propriate radiolabeled 

10 nucleotide (e.g., p^S]GTPyS or pH]GDP) and performing an appropriate filtration 
assay with detection of label. GTPase activity may be measured by incubation with 
appropriate radiolabeled nucleotide (e.g., [a-^^]GTP and analyzing reaction products 
using thin layer chromatogr^hy. 

The invention provides an antibody that bmds to BSTP-Ras/Rerg or to a 

IS polypeptide havmg significant sunilarity to BSTP-Ras/Rerg. Antibodies to these 
polypeptides may be generated, for example, as described m Example 6 below. In 
general, such antibodies may be generated by methods well known in the art and 
described, for example, in Harlow, E., and Lane, D., Antibodies: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1988. Details . 

20 and references for the production of antibodies based on an inventive polypeptide may 
also be found in U.S. Patent No. 6,008,337. Antibodies may include, but aro not . 
limited to, polyclonal, monoclonal, chimeric (e.g., 'liumanized"), and single chain 
antibodies, and Fab fragments. The invention encompasses ^Uy human" antibodies 
produced using the XenoMouse™ technology (AbGenix Corp., Fremont, CA) 

25 according to the techniques described in U.S. Patent No. 6,075,181. 

B. Diagnostics and methods of use thereof 

The invention provides reagents and methods for detecting the 
inventive polynucleotides and polypeptides described above. The reagents are useful 
30 for diagnostic purposes among others. In one aspect, the mvention provides a method 
of classifying tumors by detecting the presence of one or more of the mventive 
polypeptides or polynucleotides, e.g., polypeptides or polynucleotides comprising a 
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sequence set forth in SEQ ID N0:1, SEQ ID NO: 2, SEQ ID N0:3, SEQ ID NO: 4, or 
oij-v ^ iN^-/. J, ui puiypcpuucs iiaving signiiiuant simuanty to ine polypeptide ot iiliQ 
ID NO: 1 . As is well known in the art, a polypeptide may be detected using a variety 
of techniques that employ an antibody that binds to the polypeptide. In addition, in 
5 certain embodiments of the invention the polypeptides are detected using other 
modalities known in the art for the detection of polypeptides, such as aptamers 
(Aptamers, Molecular Diagnosis, Vol. 4, No. 4, 1999), reagents derived from 
combinatorial libraries for specific detection of protems in complex mixtures, random 
peptide afSnity reagents, etc. In general, any appropriate method for detecting a 
10 polypeptide may be used in conjunction with the present invention, allhough 
antibodies may represent a particularly appropriate modality. 

Thus the invention provides a method for classifymg a disease comprising the 
steps o£ (a) obtaining cells or tissue from a site of disease; (b) detectmg a polypeptide 
having a sequence selected from the group consisting of SEQ ID N0:1, or a 
15 p61ypeptide having significant similarity to tiie polypeptide of SEQ ID NOrl; and (c) 
placing the disease into one of a set of predetermined categories based on detection of 
the polypeptide. The predetermined categories are, in general, characterized by 
differences in their expression of BSTP-Ras/Rerg and also by differences in some 
aspect of tumor phenotype. For example, and without intending to be limitmg, one 
20 predetermined category may be characterized by a good prognosis; one predetermmed 
category may be characterized by a poor prognosis; and one predetermined category 
may be characterized by a non-responder phenotype. In addition to differences in 
phenotype, the predetermined categories also feature differences in fee expression of 
BSTP-Ras, thus allowing the placement of a tumor into one of the categories based on 
25 tiie detection (which may include measurement) of BSTP-Ras/Rerg in a sample from 
the tumor. Thus tiie assignment may be used as a basis for providing diagnostic, 
prognostic, and/or predictive information to a patient havmg the tumor. The 
assignment may be used to stratify a patient for a clinical trial, to select a particular 
treatment, etc. 

30 The invention also provides additional methods for classifying a disease. One 

such method comprises tiie steps of: (a) obtaining a sample containing cells or tissue 
from a subject; 
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(b) detecting the presence of a polynucleotide that encodes a polypeptide having the 
sequence of SEQ ID NO: 1 or a polypeptide having significant similarity to the 
polypeptide of SEQ ID NO: 1 within the sample; and (c) assigning the disease to one 
of a set of predetermined categories based on detection of the polynucleotide. Any 
S appropriate technique can be used to perform the detection step including, but not 
limited to, the various hybridization-based assays described herem. It is to be 
understood that nucleic acids within the cell or tissue sample may be subjected to 
various procedures such as purification, reverse transcription, aniplification, etc., prior 
to the actual detection. Thus the detection itself may be performed on a sample 

10 derived fi'om the cell or tissue sample through such a procedure and may involve 

detection of a complementary polynucleotide. Detection within such a derived sample 
indicates the presence of the polynucleotide witfam the cell or tissue sample. 
Similarly, the methods that involve detecting the presence of a polypeptide within a 
cell or tissue sample may involve detection of the polypeptide within a sample derived 

1 S fit)m the cell or tissue sample. 

The mvention encompasses a number of uses for antibodies that bind to 
BSTP-Ras/Rerg. In a broad aspect, antibodies that bind to BSTP-Ras/Rerg can be 
used to provide mformation useful for diagnosis, prognosis, classification, or 
monitoring of a disorder characterized by the inappropriate expression of the 

20 polypeptide. TTie term "inappropriate expression", as used herein, can include, but is 
not lunited to, underexpression, overexpression, or e3q)ression in a cell type or organ 
in which the polypeptide is not normally expressed. Inappropriate expression can 
include (1) underexpression relative to normal for that cell or tissue, (2) 
overexpression relative to normal for that cell or tissue, or (3) mislocalization witfam a 

25 cell or tissue relative to normal for that cell or tissue. In order to provide a basis for 
diagnosis, prognosis, classification, or monitoring of a disorder characterized by the 
mappropriate expression of the BSTP-Ras/Rerg polypeptide, a nonnal or standard 
expression profile can first be established by measuring the expression of BSTP- 
Ras/Rerg m cells, tissues, body fluids, etc., obtained fi-om subjects not suflfering fi'om 

30 the disorder. In general, a range of values may be considered normal, and departure 
fi'om withm this range of values may be taken to indicate that an individual suffers 
fi'om or is at increased likelihood to develop the disorder. Of course in certain cases 



60 



wo 02/08282 



PCTAJSOl/23574 



the information will simply consist of an indication of the presence or absence of 
BSTP-Ras/Kerg in a cell or tissue sample (e.g., a sample of breast cancer cells or 
tissue), regardless of whether BSTP-Ras/Rerg expression is considered 
"inappropriate". 

5 As used herein the term "diagnostic information" includes, but is not limited 

to, any type of information Aat is useful in determinmg whether a patient has, or is at 
increased risk for developmg, a disease or disorder; for providing a prognosis for a 
patient having a disease or disorder; for classifying a disease or disorder; for 
monitoring a patient for recurrence of a disease or disorder; for selecting a preferred 
10 tfierapy; for predicting the likelihood ofresponse to a therapy, etc. Incertam 

embodiments of the invention, antibodies to BSTP-Ras/Rerg are used for providing 
diagnostic information for cancer, particularly for breast cancer. In general, 
diagnostic assays in which the antibodies may be employed mclude methods that use 
the antibody to detect BSTP-Ras/Rerg in a tissue sample, cell sample, body fluid 
15 sample (e.g., serum), cell extract, etc. Thus the invention provides a method for 

detecting a polypeptide comprising an ammo acid sequence set forth in SEQ ID NO:l 
or a polypeptide having significant similarity to the polypeptide of SEQ ID N0:1 in a 
biological sample comprising steps of: (a) contacting die biological sample with an 
antibody that binds to the polypeptide of SEQ ID N0:1; and (b) determming whether , 
20 the antibody specifically binds to the sample, the binding being an mdication that the 
sample contains the polypeptide. The biological sample can be processed in any of a 
variety of ways prior to being placed m contact with the antibody. 

Many detection methods typically involve the use of a labeled secondary 
antibody that recognizes the prunaiy antibod>^ (i.e., the antibody that bmds to the 
25 polypq)tide being detected). Depending upon the nature of the sample, appropriate 
methods include, but are not limited to, immunohistochemistiyj radioimmunoassay, 
ELISA, immunoblotting, and FACS analysis. Assays for an activity such as GTP 
bmding or hydrolysis may also be employed. In the case where the polypeptide is to 
be detected in a tissue sample, e.g., a biopsy sample, immunohistochemistry may be a 
30 particularly appropriate detection method. Techniques for obtaining tissue and cell 
samples and performing immunohistochemistry and FACS are well known in the art. 
Such techniques are routinely used, for example, to detect the ER in breast tumor 
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tissue or cell samples. In general, such tests will include a negative control, which can 
involve applying the test to normal tissue so that the signal obtained thereby can be 
compared with the signal obtained from the sample being tested. In tests in which a 
secondary antibody is used to detect the antibody that binds to the polypeptide of 
5 interest, an appropriate negative control can uivolve perfonning the test on a portion 
of the sample with the omission of the antibody that binds to the polypeptide to be 
detected, i.e., with the omission of the primaiy antibody. Antibodies suitable for use 
as diagnostics generally exhibit high specificity for the target polypeptide and low 
background. In general, monoclonal antibodies are preferred for diagnostic purposes. 

10 The invention encompasses the use of protein arrays, mcluding antibody arrays, for 
detection of the polypeptide. The use of antibody arrays is described, for example, in 
Haab, B., et al, *'Protein microarrays for highly parallel detection and quantitation of 
specific proteins and antibodies in complex solutions". Genome Biol. 2001 ;2(2), 
2001 . Other types of protein arrays are known in the art. 

IS In general, the results of such a test can be presented in any of a variety of 

formats. The results can be presented in a qualitative &shion. For example, the test 
report may indicate only whether or not a particular polypeptide was detected, perhaps 
also with an indication of the limits of detection. The results may be presented in a 
semi-quantitative fashion. For example, various ranges may be defined, and the 

20 ranges may be assigned a score (e.g., 1+ to 4+) that provides a certain degree of 

quantitative information. Such a score may reflect various factors, e.g., the number of 
cells m which the polypeptide is detected, the intensity of the signal (which may 
mdicate the level of expression of the polypeptide), etc. The results may be presented 
. in a quantitative fashion, e.g., as a percentage of cells in which the polypeptide is 

25 detected, as a protein concentration, etc. As will be appreciated by one of ordinary 
skill m the art, the type of output provided by a test will vary depending upon the 
technical limitations of the test and the biological significance associated with 
detection of the polypeptide. For example, in the case of certain polypeptides a purely 
qualitative output (e.g., whether or not the polypeptide is detected at a certain 

30 detection level) provides significant information. In other cases a more quantitative 
output (e.g., a ratio of the level of expression of the polypeptide in the sample bemg 
tested versus the normal level) is necessary. 
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A particular use for antibodies that bind to BSTP-Ras/Rerg is to classify breast 

w^t.rvwii uiw WA^iC^ddiUll ICVCI Ul lliC £>02 'KAC^/Kerg 

gene, or the association between a gene subset that includes the BST-RAS/Rerg gene, 
and a tumor subset having a particular phenotype (e.g., a good prognosis phenotype, a 
5 poor prognosis phenotype, a non-responder phenotype, etc.). Immunohistochemistiy 
using an antibody that binds to BSTP-Ras/Rerg can be employed to detect BSTP- 
Ras/Rerg in a tumor tissue or cell sample, thereby providing mformation useful in * 
determining whether the tumor expresses or overexpresses the polypeptide. 
Additional detection methods including RIA, ELISA, immunoblotting, and FACS 
1 0 analysis. Thus the present invention provides a test for classifying tumors. 

The result of such a test (e.g., whether a given tumor expresses or 
overexpresses BSTP-Ras/Rerg, quantitative expression level of BSTP-Ras/Rerg, etc.) 
can be used to provide information about the prognosis of the tumor or the likelihood 
that the tumor will respond to therapy. In certain embodiments of the inventive 
1 5 methods a single antibody is used whaeas in other embodiments of the invention 
multiple antibodies, directed either against the same or against different polypeptides 
can be used to increase the sensitivity or specificity of the test or to provide more 
detailed information than that provided by a single antibody. Thus flie invention 
encompasses the use of a battery of antibodies, one or more of which binds to BSTP- 
20 Ras/Rerg. 

Althougji in many cases detection of polypeptides using antibodies represents 
the most convenient means of determining whether BST-RAS/Rerg is expressed (or 
overexpressed) in a particular sample, the invention also encompasses the use of 
polynucleotides for this purpose. The mvention provides methods for detecting a 

25 polynucleotide encoding a polypeptide comprising an amino acid sequence set forth in 
SEQ ID N0:1, or a polypeptide having significant similarity to the polypeptide of 
SEQ ID N0:1, in a biological sample. One such method comprises steps of: (a) 
hybridizing a nucleic acid complementary to the polynucleotide encoding a 
polypeptide comprising an amino acid sequence set forth in SEQ ID N0:1, or a 

30 polypeptide having significant similarity to the polypeptide of SEQ ID N0:1, to at 
least one nucleic acid in the biological sample, thereby foiming a hybridization 
complex; and (b) detecting the hybridization complex, wherein the presence of the 
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hybridization complex indicates the presence of a polynucleotide encoding the 
polypeptide in the biological sample. A second such method comprises steps of: (a) 
hybridizing a nucleic acid.encoding a polypeptide comprising an amino acid sequence 
set forth in SEQ ID NO:l, or a polypeptide having significant similarity to the 
5 polypeptide of SEQ ID NO:l, to at least one nucleic acid complementary to a nucleic 
acid in the biological sample, thereby forming a hybridization complex; and (b) 
detecting the hybridization complex, wherein the presence of the hybridization 
complex indicates the presence of a polynucleotide encoding the polypeptide in the 
biological sample. In other words, detection can comprise detecting either a 

10 polynucleotide encoding a polypeptide comprising an amino acid sequence set forth m 
SEQ ID NO:l or a polypeptides having significant similarity to the polypeptide of 
SEQ ID N0:1, or detecting the complement of such a polynucleotide. For example, in 
certain embodiments of the inventive method mRNA in the biological sample is 
detected while in other embodiments of the invention cDNA synthesized from mRNA 

15 in the biological sample is detected. 

The hybridization complex can be detected in any of a variety of ways. For 
example, the hybridization complex may be formed on a microarray (e.g., a cDNA 
array or an oligonucleotide array) and detected using techniques such as those 
described herein or related techniques well known m the art. Microarray analysis is 

20 but one means by which polynucleotides can be used to detect or measure BST- 
RAS/Rerg expression. E>q)ression of BST-RAS/Rerg can also be measured by a 
variety of oth^ techniques that make use of a polynucleotide corre^onding to part or 
all of the BST-RAS/Rerg gene rather ttian an antibody that bmds to a polypeptide 
encoded by the gene. Appropriate techniques include, but are not limited to, in situ 

25 hybridization. Northern blot, and various nucleic acid amplification techniques such 
as PGR, quantitative PGR, and the ligase chain reaction. Various additional 
techniques for detecting nucleic acids may be employed in the practice of the 
invention. See, for example, U.S. Patent No. 6,057,105. 

Another aspect of the invention comprises a kit to test for the presence of any 

30 of the inventive polynucleotides or polypeptides, e.g., in a cell or tissue sample or in a 
body fluid. The kit can comprise, for example, an antibody for detection of a 
polypeptide (e.g., the polypeptide of SEQ ID NO: 1 or a polypeptide having significant 
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similarity to the polypeptide of SEQ ID NO:l) or a probe for detection of a 
polynucleotide. In addition, tiie kit can comprise a reference sample, Insiirucliuns for 
processing samples, performing the test and interpreting the results, buffers and other 
reagents necessary for performing the test. In certain embodiments the kit can 
5 comprise a panel of antibodies. In certain embodiments the kit comprises a cDNA or 
oligonucleotide array for detection of expression of the gene encoding BSTP-Ras, 
e.g., for detecting the presence of a polynucleotide comprising the nucleotide 
sequence set forth in SEQ ID N0:2, SEQ ID NO:3, SEQ ID N0:4, or SEQ ID NO:5. 
Yet another aspect of the invention comprises detecting mutations in the gene 

10 encodmg BSTP-Ras/Rerg and/or m regulatory regions of the gene. The invention 
further encompasses detecting allelic variants of the gene encoding BSTP-Ras/Rerg. 
As mentioned above, mutations in certain genes (e.g., BRCA-I) have been 

associated with an increased risk of breast cancer. Mutations within regulatory 
regions and/or within the BSTP-Ras/Rerg coding sequence may affect expression 

IS levels or activity of BSTP-Ras/Rerg. Detection of such mutations may be used to 
provide prognostic information, diagnostic information, and/or to guide treatment 
decisions for breast cancer. For example, the fact that BST-RAS/Rerg is an estrogen- 
responsive gene (as demonstrated in Example 10) and tiie presence of two potential 
ER binding sites within the BST-RAS/Rerg promoter region suggest that expression of • 

20 BSTP-Ras/Rerg is regulated by die ER and that mutations in these sites that affect the 
ability of ER to bind may affect BSTP-Ras/Rerg expression. Thus fee invention 
provides methods for obtaining prognostic, diagnostic, or therapeutic information 
comprising (i) obtaming a sample containing cells or tissue fiom a subject, and (ii) 
detecting, wilfain the sample, a mutation in a regulatory or codmg region of the gene 

25 tiiat encodes BSTP-Ras/Rerg. In certain of the methods the mutation is in an ER 

bmding site. In certain of the methods the mutation is in a site that encodes a GTPase 
domain. Techniques for detecting mutations and allelic variants are well known in the 
art, and the inventive methods encompass the use of any appropriate technique. Such 
techniques include use of microarrays (e.g., oligonucleotide arrays) to detect single 

30 nucleotide polymorphisms (SNPs) associated with a particular allele, use of 
microarrays (e.g.,oligonucleotide arrays) to detect substitutions, deletions, etc.. 
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detection of restriction fragment length polymorphisms (RFLPs), direct sequencing 
of DNA isolated from an individual, etc. 

For example, to determtae whether there is a mutation in a particular region of 
a gene the region may be amplified using PGR and then sequenced. Alternately, the 
S region may be amplified using PGR and hybridized to an oligonucleotide array 

containing probes that represent different sequence variants as described, for example, 
in Hacia, J., "Resequencing and mutational analysis using oligonucleotide arrays'', 
Natvre Genetics, Supplement, 21, pp. 42-47, 1999, and references therein. Smce 
BSTP-Ras/Rerg is growth-inhibitory, mutations that result in decreased BSTP- 
10 Ras/Rerg activity, e.g., tumors whose cells harbor mutations that result in decreased 
GTP-binding activity or increased GTPase activity may be associated with a worse 
prognosis and/or more a aggressive phenotype than tumors whose cells do not harbor 
such mutations. 

15 C. Gompositions and Therapeutic Agents 

The invention includes, the use of the polynucleotides, polypeptides, and 
antibodies described herein as therapeutic agents for the treatment of cancer among 
other uses. In particular, the invention contemplates the use of the polynucleotides, 
polypeptides, and antibodies described herein for the treatment of breast cancer, 

20 although their use for the treatment of other forms of cancer is also within the scope of 
the invention. In addition to use as therapeutics, certain of the compositions are useful 
for altering cell phenotypes including cell growth properties. The mvention 
encompasses agonists and antagonists to BSTP-Ras^erg. Such agents (which 
mclude, but are not limited to, antibodies, small molecules, antisense nucleic acids) 

25 may be produced or identified using any of a variety of methods knovm in the art. For 
example, a purified inventive BSTP-Ras/Rerg polypeptide or fragment thereof may be 
used to raise antibodies or to screen libraries of compounds to identify those that 
specifically bind to the polypeptide. 

Preferably antibodies suitable for use as therapeutics exhibit high specificity 

30 for the target polypeptide and low background binding to other polypeptides. In 

general, monoclonal antibodies are preferred for therapeutic purposes. In the case of 
breast cancer, antibodies against the HER2/neu/ErbB2 polypeptide (a polypeptide 
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homologous to the epidennal growth factor receptor) represent a paradigm in terms of 
the development of tlierapeutic antibodies. The HER2/neu/ErbB2 gene is 
overexpressed in approximately 25 to 30 percent of metastatic breast tumore, and an 
antibody against the HER2/neu/ErbB2 polypeptide, Herceptin® (Tmstuzumab) is 
5 approved for the treatment of certain patients with metastatic breast cancer, 

confirmmg the utility of therapeutic antibodies dfrected agamst polypeptides that are 
specifically overexpressed in particular tumors subsets. Antibodies directed against a 
polypeptide expressed by a cell may have a number of mechanisms of action. In 
certain instances, e.g., in the case of a polypeptide that exerts a growth stimulatory 
10 effect on a cell, antibodies may directly antagonize the eflFect of the polypeptide and 
thereby arrest tumor progression, trigger apoptosis, etc. In certain embodimaits of the 
invention the antibody may serve to target a toxic moiety to the cell. Thus the 
invention encompasses the use of antibodies that have been conjugated with a 
cytotoxic agent, e.g., a toxin such as ricin or diphtheria toxm, a radioactive moiety, 
1 5 etc. Such antibodies can be used to direct tfie cytotoxic agent specifically to cells that 
express the mventive polypeptide. 

Although certain modulators may function through direct interaction with a 
polypeptide such as BSTP-Ras/Rerg, e.g., by stimulating or inhibiting its activity, 
others may function by affecting expression of tiie polypeptide. It may be desirable to 
20 increase the expression of a gene encoding BSTP-Ras/Rerg or to increase the activity 
of BSTP-Ras/Rerg. For example, in the case of genes (such as that encoding BSTP- 
Ras/Rerg), v^iiose expression correlates with that of the estrogen receptor and 
therefore is associated with a good prognosis and/or which exhibit growth-mhibitoiy 
eflFeots, it may be desirable to increase the ejqjression of such genes or the activity of 
25 the coiresponding polypeptides in tumors that fail to express these genes or express 
them at low levels relative to normal cells. 

Modulators (e.g., inhibitors or activators) ofBST-RAS/Rerg gene expression 
are within the scope of the invention and may be detected by screening libraries of 
compounds usmg, for example cell lines that express the polypeptide or a version of 
30 the polypeptide that has been modified to include a readily detectable moiety. 
Methods for identifymg compounds capable of modulating gene expression are 
described, for example, in U.S. Patent No. 5,976,793. The screenmg methods 
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described therein are particularly appropriate for identifying compounds that do not 
naturally occur within cells and that modulate the expression of genes of interest 
whose expression is associated with a defined physiological or pathological effect 
within a multicellular organism. 
S The invention encompasses methods for identifying compounds tiiat modulate 

the activity of a polypeptide encoding BSTP-Ras/Rerg. Methods of screening for 
such interacting compounds are well known in the art and depend, to a certain degree, 
on the particular properties and activities of the polypeptide encoded by the gene. The 
invention provides methods for identifying a compound that increases the activity of 

1 0 BSTP-Ras/Rerg and methods for identifying a compound that decreases the activity of 
BSTP-Ras/Rerg. In addition, the invention provides conipounds that increase BSTP- 
Ras/Rerg activity and cotnpounds that decrease BSTP-Ras/Rerg activify. Tlie 
methods involve identifying a compound that enhances the expression or activity of a 
polypeptide comprising the amino acid sequence of SEQ ID NO:l or of a polypeptide 

1 5 having significant similarity to the polypeptide of SEQ ID NO: 1 . According to one 
embodiment, the method includes steps of: (1) contacting a polypeptide comprising 
the amino acid sequence of SEQ ID N0:1 or a polypeptide having significant 
similarity to the polypeptide of SEQ ID NO: 1 with a test compound; (2) measuring 
GTP-binding and/or GTPase activity of the polypeptide; and (3) identifying the test 

20 compound as an activator of the polypeptide if the polypeptide exhibits enhanced 
GTP-binding activity and/or reduced GTPase activity in the presence of the test 
compound relative to the level of GTP-binding activity and/or GTPase activity in the 
absence of the test compound. In certain embodimrats of the method the polypeptide 
includes a GTP-binding domain. Hie method may be performed in vitro^ e.g., using a 

23 BSTP-Ras/Rerg polypeptide or significantly similar polypeptide or fingment th^eof 
produced using recombinant DNA technology, or may be performed in vivo, e.g., by 
applying the test compound to a cell that expresses the polypeptide and isolating the 
polypeptide from the cell. In certain of the metfiods the polypeptide may be a variant 
that includes a tag (e.g., an HA tag, 6XHis tag, Flag tag, etc.) which may be used, for 

30 example, to facilitate isolation or the variant may be a fusion protein. Any suitable 
assay for OTP binding and/or GTPase activity may be used mcliiding, but not limited 
to, those described above. Suitable assays are described, for example, in Self AJ, Hall 
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A, "Measurement of intrinsic nucleotide exchange and GTP hydrolysis rates" in 

h/fathrtrlv 7?-fi'nn*t^1 O^A.'AI^IA lOO^ 

Compounds suitable for screening according to the above methods include 
small molecules, natural products, peptides, nucleic acids, etc. Sources for 
5 compounds include natural product extracts, collections of synthetic compounds, and 
compound libraries generated by combinatorial chemistry. Libraries of compounds 
are well known m the art. One representative example is known as DIVERSet™, 
available from ChemBridge Corporation, 1 698 1 Via Tazon, Suite G, San Diego, CA 
92127. DIVERSet"'^ contains between 10,000 and 50,000 drug-like, hand- 

10 synthesized small molecules. The compounds are pre-selected to form a "universal" 
library that covers the maximum pharmacophore diversity with the minimum number 
of compounds and is suitable for either high throughput or lower throughput 
screening. For descriptions of additional libraries, see, for example, Tan, et al, 
"Stereoselective Synthesis of Over Two Million Compounds Having Structural 

15 Features Both Reminiscent of Natural Products and Compatible with MSniaturized 
Cell-Based Assays**, C/ieiw&c. 120, 8565-8566, 1998; Floyd CD, Leblanc C, 
Whittaker M, Prog Med Chem 36:91-168, 1999. Numerous libraries are commercially 
available, e.g., from AnalytiCon USA Inc., P.O. Box 5926, Kingwood, TX 77325; 3- 
Dimensional Pharmaceuticals, Inc., 665 Stockton Drive, Suite 104, Exton, PA 19341- 

20 1151; Tripos, Inc., 1699 Hanley Rd., St. Louis, MO, 63144-2913, etc. In certam 

embodiments of the mvention the methods are performed in a high-throughput format 
using techniques that are well known in flie art, e.g., in multiwell plates, using 
robotics for sample preparation and dispensing, etc. Representative examples of 
various screenmg methods may be found, for example, in U.S. Patent No. 5,985,829, 

25 U.S. Patent No. 5,726,025, U.S. Patent No. 5,972,621, and U.S. Patent No. 6,015,692. 
The skilled practitioner will readily be able to modify and adapt these methods as 
appropriate for BSTP-Ras/Reig. 

The invention includes compounds identified using the above methods, i.e., 
compounds that increase or decrease one or more activities of BSTP-Ras/Rerg or that 

30 increase or decrease a polypeptide having significant similarity to the polypeptide of 
SEQIDNOiL 
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Reduction in expression of an endogenously produced polypeptide may be 
achieved by the administration of antisense nucleic acids (e.g., oligonucleotides, 
RNA, DNA, most typically oligonucleotides that have been modified to improve 
stability or targeting) or peptide nucleic acids comprising sequences complementary 
5 to those of the mRNA that encodes the polypeptide. Antisense technology and its 
applications are described in Phillips, MJ. (ed.) Antisense recA/io/ogy, Methods 
EnzymoL, Volumes 313 and 314, Academic Press, San Diego, 2000, and references 
mentioned therein. Ribozymes (catalytic RNA molecules that are capable of cleaving 
other RNA molecules) represent another approach to reducmg gene expression. Such 

1 0 ribozymes can be designed to cleave specific mRNAs corresponding to a gene of 
mterest Their use is described in U.S. Patent No. 5,972,621, and references therein. 
The invention encompasses the delivery of antisense and/or ribozyme molecules via a 
gene therapy approach in which vectors or cells expressing the antisense molecules 
are administered to an individual. 

IS The invention also encompasses the use of polynucleotides encoding BSTP- 

Ras, or fi^gments thereof as DNA vaccines. Such vaccines comprise polynucleotide 
sequences, typically inserted into vectors, that direct the expression of an antigenic 
polypeptide within the body of the individual being immunized. Details regarding the . 
development of vaccines, includmg DNA vaccines, for various forms of cancer may 

20 be found, for example, in Brinckerhoff L.H-, Thompson L.W., Slingluff C.L., Jr., 
Melanoma Vaccines, Ctirr Opin Oncol, 12(2):163-73, 2000 and m Stevenson, FJC., 
DNA vaccines against cancer: fit>m genes to therapy, v^/i. OncoL, 10(12): 1413-8, 
1999 and references therein. BSTP-Ras/Rerg polypeptides, or firagments diereo^ that 
may also find use as cancer vaccines. Any of these vaccines may be used for the 

25 prevention and/or the treatment of cancer. 

The invention includes pharmaceutical compositions comprising the inventive 
polypeptides, polynucleotides, antibodies, inhibitors, agonists, or antagonists 
described above. In general, a pharmaceutical composition will include an active 
agent in addition to one or more inactive agents such as a sterile, biocompatible 

30 carrier including, but not lunited to, sterile water, saline, buffered saline, or dextrose 
solution. The pharmaceutical compositions may be admmistered eiflier alone or in 
combination with other therapeutic agents including other chemotherapeutic agents, 
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hormones, vaccines, and/or radiation therapy. By "in combination with", it is not 

V ••»**** "*v (AgwiiLa AAAUov t/w uioaiiiiiidtwi at UlC aailiC lUllC U! lUIUIUlttlCCl 

for delivery together, although these methods of delivery are within the scope of the 
invention. In general, each agent will be administered at a dose and on a time 
schedule determined for that agent. Additionally, the mvention encompasses the 
delivery of the inventive pharmaceutical compositions in combination with agents that 
may unprove their bioavailability, reduce or modify their metabolism, inhibit their 
excretion, or modify their distribution within the body. The invention encompasses 
treatmg cancer, particularly breast cancer, by administering the pharmaceutical 
compositions of the invention. Although the pharmaceutical compositions of the 
present invention can be used for treatment of any subject (e.g., any animal) in need 
thereof, they are most preferably used in the treatment of humans. 

The pharmaceutical compositions of this invention can be administered to 
humans and oliier anhnals by a variety of routes including oral, intravenous, 
intramuscular, intraarterial, subcutaneous, intraventricular, transdermal, rectal 
intravaginal, mtraperitoneal, topical (as by powders, ointments, or drops), bucal, or as 
an oral or nasal spray or aerosol. In general the most appropriate route of 
administration will depend upon a variety of factors including the nature of the 
compound (e.g., its stability in the environment of the gastrointestinal tract), the 
condition of the patient (e.g., whether the patient is able to tolerate oral 
administration), etc. At present the intravenous route is most commonly used to 
deliver ther^eutic antibodies and nucleic acids. However, the invention encompasses 
the delivery of the inventive pharmaceutical composition by any appropriate route 
takmg mto consideration likely advances in the sciences of drug delivery. 

General considerations in the formulation and maniifacture of pharmaceutical 
agents may be found, for example, in Remington 's Pharmaceutical Sciences, 19* ed., 
Mack Publishmg Co., Easton, PA, 1995. It will be appreciated Aat certain of the 
compounds of the present invention can ©cist in free form for treatment, or, where 
^propriate, in salt form, as discussed in more detail below. Compounds to be 
utilized in the pharmaceutical compositions include compounds ^sting in free form 
or pharmaceutically acceptable derivatives thereof, as defined herein, such as 
pharmaceutically acceptable salts, esters, salts of such esters, or any other adduct or 
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derivative, whicli upon administration to a patient in need, is capable of providing, 
directly or indirectly, a compound as otherwise described herein, or a metabolite or 
residue thereof e.g., a prodrug. Thus, as used herein, the term "phamiaceutically 
acceptable salt" refers to those salts which are, within the scope of sound medical 
5 judgment, suitable for use in contact with the tissues of humans and lower animals 
without undue toxicity, irritation, allergic response and the like, and are 
commensurate with a reasonable benefit/risk ratio. Pharmaceutically acceptable salts 
are well known in the art. For example, S. M. Berge, et al. describe pharmaceutically 
acceptable salts in detail in J, Pharmaceutical Sciences^ 66: 1-19 (1977), incorporated 

10 herein by reference. The salts can be prepared m situ durmg the final isolation and 
purification of the compounds of the invention, or separately by reacting the fi-ee base 
ftmction with a suitable organic acid. Examples of pharmaceutically acceptable, 
nontoxic acid addition salts are salts of an amino group formed with inorganic acids 
such as hydrochloric acid, hydrobromic acid, phosphoric acid, sulfiiric acid and 

IS perchloric acid or widi organic acids such as acetic acid, oxalic acid, maleic acid, 

tartaric acid, citric acid, succinic acid, or malonic acid or by using oth^ methods used 
in the art such as ion exchange. Other pharmaceutically acceptable salts include 
adipate, alginate, ascorbate, aspartate, benzenesulfonate, benzoate, bisulfate, borate, 
butyrate, camphorate, camphorsulfonate, citrate, cyclopentanepropionate, digluconate, 

20 dodecylsulfite, ethanesulfonate, formate, flunarate, glucoheptonate, glycerophosphate, 
gluconate, hemisulfate, heptanoate, hexanoate, hydroiodide, 2-hydroxy- 
ethanesulfonate, lactobionate, lactate, laurate, lauryl sulfate, malate, maleate, 
malonate, metfaanesulfonate, 2-naphthalenesulfonate, nicotinate, nitrate, oleate, 
oxalate, pahnitate, pamoate, pectinate, persulfate, 3-phenylpropionate, phosphate, 

25 picrate, pivalate, propionate, stearate, succinate, sul&te, tartrate, thiocyanate, p- 
toluenesulfonate, undecanoate, valerate salts, and the like. Representative alkali or 
alkaline earth metal salts include sodium, lithium, potassium, calcium, magnesium, 
and the like. Further pharmaceutically acceptable salts uiclude, when appropriate, 
nontoxic ammonium, quaternary ammonium, and amine cations formed using 

30 counterions such as halide, hydroxide, carboxylate, sulfate, phosphate, nitrate, lower 
alkyl sulfonate and aryl sulfonate. 

Additionally, as used herein, the term "pharmaceutically acceptable ester" 
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refers to esters that hydrolyze in vivo and include those that break down readily in the 
human body to leave the parent compound or a salt thereof. Suitable ester groups 
include, for example, those derived from pharmaceutically acceptable aliphatic 
carboxylic acids, particularly alkanoic, alkenoic, cycloalkanoic and alkanedioic acids, 
5 in which each alkyl or alkenyl moiety advantageously has not more than 6 carbon 
atoms. Examples of particular suitable esters includes formates, acetates, propionates, 
butyrates, aciylates and ethylsuccinates. 

Furthermore, the term "pharmaceutically acceptable prodrugs" as used h^m 
refers to those prodrugs of the compounds of the present invention that are, within the 

1 0 scope of sound medical judgment, suitable for use in contact with the tissues of 
humans and lower animals without imdue toxicity, kritation, allergic response, and 
the like, commensurate with a reasonable benefit/risk ratio, and effective for their • 
int^ded use, as well as the zwitterionic forms, where possible, of the compounds of 
the invention. The term "prodrug" refers to compounds that are rapidly transformed in 

15 vivo to yield a particular active compound, for example by hydrolysis in blood. A 
thorough discussion is provided in T. Higuchi and V. Stella, ^Tro-drugs as Novel 
Delivery Systems", Vol. 14 of the A.C.S. Symposium Series, and in Edward B. 
Roche, ed., Bioreversible Carriers in Drug Design^ American Pharmaceutical 
Association and Pergamon Press, 1987, both of which are incorporated herein by 

20 reference. 

As mentioned above, the phamiaceutical compositions of the present invention 
additionally comprise a pharmaceutically acceptable carrier, which, as used herein, 
means a non-toxic, inert solid, semi-solid or liquid filler, diluent, encapsulating 
material, or formulation auxiliary of any type. Some examples of materials which can 

25 serve as pharmaceutically acceptable cairiers are sugars such as lactose, glucose and 
sucrose; starches such as com starch and potato starch: cellulose and its derivatives 
such as sodium carboxymethyl cellulose, etiiyl cellulose and cellulose acetate; 
powdered tragacanth; malt; gelatin; talc; excipients such as cocoa butter and 
suppository waxes; oilis such as peanut oil, cottonseed oil; safflower oil; sesame oil; 

30 olive oil; com oil and soybean oil; glycols; such a propylene glycol; esters such as 
ethyl oleate and ethyl laurate; agar; buffering agents such as magnesium hydroxide 
and aluminum hydroxide; alginic acid; water; isotonic saline; Ringer's solution; ethyl 
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alcohol, and phosphate buffer solutions, dextrose solutions, as well as other non-toxic 
compatible lubricants such as sodium lauiyl sulfate and magnesium stearate, as well 
as coloring agents, releasing agents, coating agents, sweetening, flavoring and 
perfuming agents, preservatives and antioxidants can also be present m the 
5 composition, according to the judgment of the formulator. 

Liquid dosage forms for oral admmistration mclude pharmaceutically 
acceptable emulsions, microemulsions, solutions, suspensions, syrups and elbcirs. In 
addition to the active compounds, the liquid dosage forms may contain inert diluents 
commonly. used in the art such as, for example, water or other solvents, solubilizing 

10 agents and emulsifiers such as ethyl alcohol, isopropyl alcohol, ethyl carbonate, ethyl 
acetate, benzyl alcohol, benzyl benzoate, propylene glycol, 1,3-butylene glycol, 
dimethylformamide, oils (in particular, cottonseed, groundnut, com, germ, olive, 
castor, and sesame oils), glycerol, tetrahydrofiirfuryl alcohol, polyethylene glycols 
and fatty acid esters of sorbitan, and mixtures thereof. Besides inert diluents, the oral 

15 compositions can also include adjuvants such as wetting agents, emulsifying and 
suspending agents, sweetening, flavoring, and perfuming agents. 

Injectable preparations, for example, sterile injectable aqueous or oleaginous 
suspensions may be formulated according to the known art using suitable dispersing 
or wetting agents and suspending agents. The sterile injectable preparation may also 

20 be a sterile injectable solution, suspension or emulsion in a nontoxic parenterally 

acceptable diluent or solvent, for example, as a solution in 1,3-butanediol. Among the 
acceptable vehicles and solvents that may be employed are water. Ringer's solution, 
U.S.P. and isotonic sodium chloride solution. In addition, sterile, fixed oils are 
conventionally employed as a solvent or suspenduig medium. For this purpose any 

25 bland fixed oil can be employed includuig synthetic mono- or diglycerides. In 
addition, fatty acids such as oleic acid are used in the preparation of injectables. 

The injectable formulations can be sterilized, for example, by filtration 
through a bacterial-retaining filter, or by incorporating sterilizing agents in the form of 
sterile solid compositions which can be dissolved or dispersed in sterile water or other 

30 sterile injectable medium prior to use. 

In order to prolong the effect of a drug, it is often desirable to slow the 
absorption of the drug from subcutaneous or intramuscular injection. This may be 
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accomplished by the use of a liquid suspension of crystalline or amorphous material 
With poor water solubility. The rate of absofption of tlie drug then depends upon its 
rate of dissolution which, in turn, may depend upon crystal size and crystalline form. 
Alternatively, delayed absorption of a parenterally administered drug form is 
accomplished by dissolving or suspending the drug in an oil vehicle. Injectable depot 
forms are made by forming microencapsulated matrices of the drug in biodegradable 
polymers such as polylactide-polyglycolide. Depending upon the ratio of drug to 
polymer and the nature of the particular polymer employed, the rate of drug release 
can be controlled. Examples of other biodegradable polymers include 
poly(orthoesters) and poly(anhydrides). Depot mjectable formulations are also 
prepared by entrapping the drug in liposomes or microemulsions which are 
compatible wift body tissues. 

Compositions for rectal or vaginal administration are preferably suppositories 
which can be prepared by mbdng the compounds of this mvention with suitable non- 
uritatmg excipients or carriers such as cocoa butter, polyethylene glycol or a 
suppository wax which are solid at ambient temperature but liquid at body 
temperature and therefore melt m tiie rectum or vagmal cavity and release the active 
compound. 

Solid dosage forms for oral administration include capsules, tablets, pills, 
powders, and granules. In such solid dosage forms, the active compound is mixed 
with at least one inert, pharmaceutically acceptable excipient or carrier such as sodium 
citrate or dicalcium phosphate and/or a) fillers or extenders such as starches, lactose, 
sucrose, glucose, mannitol, and silicic acid, b) binders such as, for example, 
carboxymethylcellulose, alginates, gelatin, polyvinylpyrrolidinone, sucrose, and 
acacia, c) humectants such as glycerol, d) disintegrating agents such as agar-agar, 
calcium carbonate, potato or t^ioca starch, alginic acid, certain silicates, and sodium 
carbonate e) solution retarding agents such as paraflBn, f) absorption accelerators such 
as quaternary anunonium compounds, g) wetting agents such as, for example, cetyl 
alcohol and glycerol monostearate, h) absorbents such as kaolin and bentonite clay, 
and i) lubricants such as talc, calcium stearate, magnesiimi stearate, solid polyethylene 
glycols, sodium lauryl sulfate, and mixtures thereof. In the case of capsules, tablets 
and pills, the dosage form may also comprise buffering agents. 
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Solid compositions of a similar type may also be employed as fillers in soft 
and hard-filled gelatin capsules using such excipients as lactose or milk sugar as well 
as high molecular weight polyethylene glycols and the like. The solid dosage forms of 
tablets, dragees, ciq)sules, pills, and granules can be prepared with coatings and shells 
5 such as enteric coatmgs and other coatings well known in tiie pharmaceutical 

formulating art They may optionally contain opacifying agents and can also be of a 
composition that they release the active ingredient(s) only, or preferentially, in a 
certain part of the mtestinal tract, optionally, in a delayed manner. Examples of 
embedding compositions that can be used include polymeric substances and waxes. 

10 Solid compositions of a similar type may also be employed as fillers m soft and hard- 
filled gelatin capsules using such excipients as lactose or milk sugar as well as high 
molecular weight polethylene glycols and the like. 

The active compounds can also be in micro-encapsulated fomi with one or 
more excipients as noted above. The solid dosage forms of tablets, dragees, capsules, 

15 pills, and granules can be prepared with coatings and shells such as enteric coatmgs, 
release controlling coatings, and other coatings well knovm in the pharmaceutical 
formulating art. In such solid dosage forms the active compound may be admixed 
with at least one inert diluent such as sucrose, lactose or starch. Such dosage forms 
may also comprise, as is normal practice, additional substances otiier than inert 

20 diluents, e.g., tableting lubricants and other tableting aids such a magnesium stearate 
and microciystalline cellulose. In the case of capsules, tablets and pills, the dosage 
forms may also comprise buffering agents. They may optionally contain opacifying 
agents and can also be of a composition that they release the active ingredient(s) only, 
or preferentially, in a certam part of the mtestinal tract, optionally, in a delayed 

25 manner. Examples of embedding compositions that can be used include polymeric 
substances and waxes. 

Dosage forms for topical or transdermal administration of a compound of this 
invention include ointments, pastes, creams, lotions, gels, powders, solutions, sprays, 
inhalants or patches. The active component is admixed under sterile conditions with a 

30 pharmaceutically acceptable carrier and any needed preservatives or buffers as may be 
required. Ophthalmic formulation and ear drops are also contemplated as being within 
the scope of this invention. The ointments, pastes, creams and gels may contain, in 
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addition to an active compound of this invention, excipients such as animal and 

xwuj, una, woA^ca, |iaiaixiii5, 5icuuii, iragdcanin, ceiiuiose aenvaiives, 
polyethylene glycols, silicones, bentonites, silicic acid, talc and zinc oxide, or 
mixtures thereof. Powders and sprays can contain, in addition to the compounds of 
5 this invention, excipients such as lactose, talc, silicic acid, aluminum hydroxide, 
calcium silicates and polyamide powder, or mixtures of these substances. Sprays can 
additionally contain propellants known in the art such as chlorofluorohydrocarbons. 

Transdermal patches have the added advantage of providing controlled 
delivery of a compound to the body. Such dosage forms can be made by dissolving or 
10 dispensing the compound in the proper medium. Absorption enhancers can also be 
used to increase the flux of the compound across the skin. The rate can be controlled 
by either providing a rate controlling membrane or by dispersing tfie compound in a 
polymer matrk or gel. 

In yet another aspect, the present invention also provides a pharmaceutical 
15 pack or kit comprising one or more containers filled with one or more of the 
ingredients of the pharmaceutical compositions of the invention, and in certain 
embodiments, includes an additional approved therapeutic agent for use as a 
combmation therapy. Optionally associated with such container(s) can be a notice in 
the form prescribed by a governmental agency regulating the manu&cture, use or sale 
of pharmaceutical products, which notice reflects approval by the agency of 
manufecture, use or sale for human administration. Instructions for use of the 
compound(s) may also be included. 

According to the methods of treatment of the present invention, cancer, 
particularly breast cancer, is treated or prevented in a patient such as a human or other 
mammal by administering to the patient a therapeutically effective amount of a 
compound of the invention, fai such amounts and for such time as is necessary to 
achieve the desired result. By a "therapeutically effective amount" of a compound of 
the mvention is meant a sufficient amount of the compound to treat (e.g. to ameliorate 
the symptoms o^ delay progression of, prevent recurrence of, cure, etc.) cancer, 
particularly breast cancer, at a reasonable benefit/risk ratio, which involves a 
balancing of the efficacy and toxicity of the compound. In general, therapeutic 
efficacy and toxicity may be determined by standanl pharmacological procedures m 
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cell cultures or with experimental animals, e.g., by calculating the ED50 (the dose that 
is therapeutically effective in 50% of the treated subjects) and the LD50 (^^ dose that 
is lethal to 50% of treated subjects). The EDif/LDso represents the therapeutic index 
of the compound. Although in general drugs having a large therapeutic index are 
5 preferred, as is well known in the art, a smaller therapeutic index may be acceptable in 
the case of a serious disease, particularly in the absence of alternative therapeutic 
options. Ultimate selection of an appropriate range of doses for administration to 
humans is determined in tiie course of clinical trials. 

It will be understood that the total daily usage of the compounds and 

10 compositions of the present invention for any given patient will be decided by tiie 
attending physician within the scope of sound medical judgment. The specific 
therapeutically effective dose level for any particular patient will depend upon a 
variety of fectors including the disorder being treated and the severity of the disorder; 
the activity of the specific compound employed; the specific composition employed; 

1 S the age, body weight, general health, sex and diet of the patient; the time of 

administration, route of administration, and rate of excretion of the specific compound 
employed; the duration of the treatment; drugs used in combination or coincidental 
with the specific compound employed; and like &ctors well known in the medical 
arts. 

20 The total daily dose of the compounds of this invention administered to a 

human or other mammal in single or in divided doses can be in amounts, for example, 
fiom 0.01 to 50 mg/kg body weight or more usually fi-om 0.1 to 25 mg/kg body 
weight Single dose compositions may contain such amounts or submultiples thereof 
to make up the daily dose. In general, treatment regimens according to the present 

25 mvention comprise administration to a patient in need of such treatment fi'om about 
0. 1 fig to about 2000 mg of the compound(s) of the mvention per day in single or 
multiple doses. 
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EXAMPLES 

Note: A numbered iist of references for tfie Examples appears following the 
Examples, all of which are incorporated herem by reference. 

Example I 

Preparation of Microarrays Containing 8498 Human cDNAs 

The human cDNA clones used in this study were obtmned from Research Genetics 
(Huntsville AB, USA) as bacterial colonies in 96-weIl microtiter plates. The clones 
were chosen from a set of 15,000 cDNA clones that conesponded to the Research 
Genetics Human Gene Filters sets GF200-202 (http://www.resgen>com/^ . These 
clones form part of a set of clones assembled by the LM.A.G.E. consortium (Lennon, 
G.G., Aufifray, C, Polymeropoulos, M., Scares, M.B. The LM.A.G.E. Consortium: 
An Integrated Molecular Analysis of Genomes and their Expression. Genomics 
33:151-152,1996) and are identified by LMj\.G£. clone ID numbers. All clones 
printed on these arrays were sequence validated as part of a product offered at 
Research Genetics, Inc. We estunate ftat greater than 97% of the clones on the array 
are correctly identified. 

A detailed protocol for flie production of the cDNA microarrays used m this 
study is available at http://cmgm.stanford.edu/pbix)wn/protocols Jitml and is 
reproduced below with insubstantial changes. As described below, flie protocol 
includes steps of (1) cleaning the glass slides onto which the DNAs (e.g., products of 
PGR reactions) are to be spotted; (2) spotting the DNAs onto the glass slides with an 
arrayer; (3) Post processing to prepare arrays containing spotted DNAs for 
hybridization. All procedures are done at room temperature and with double distilled 
water unless otherwise stated. Unless otherwise stated, in this Example and the 
following Examples, reagents are prepared according to protocols available in 
Maniatis, T., Sambrook, J. and Fritsch, E., Molecular Cloning: A Laboratory Manual 
(3 Volume Set), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1989, the 
contents of which are herein incorporated by reference. 

Cleaning Slides 
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Use 30 slide racks in 350mL glass dishes 

1. Dissolve 50g of NaOH pellets into 150ml ddH20 

2. Add 200ml of 95% EtOH, stir until completely mixed 

3. If solution remains cloudy, add ddH20 until clear 
S 4. Pour solution into glass slide box. 

5. Drop in 30 slides in a metal rack. (Gold Seal slides, Cat. 3010) 

6. Let soak on an orbital shaker for at least two hours 

7. Rinse slides by transferring rack to slide dish filled with ddH20 

8. Repeat ddH20 rinses x3. If s important to remove all traces of the 
10 NaOH-eflianol. 

9. Prepare Poly-l-lysine solution: Use Sigma Poly-l-lysine solution. Cat. No. 

8920 

10. Add 70mL poly-Hysine to 280ml of water 

1 1 . Transfer slides to poly-l-lysine solution and let soak for 1 hour. 

IS 12. Remove excess liquid fix}m slides by spinning tiie rack of slides on microtiter 

plate 

carriers at SOOrpm. 

13. Dry slides at 40 degrees C for 5 minutes in a vacuum oven. 

14. Store slides in a closed box for at least two weeks prior to use. 

20 IS. Before printing arrays, check a sample slide to make sure if s hydrophobic 

(water should bead off it) but the lysine coatmg is not turning opaque. 

Arraying 

1 - Transfer PGR reactions to 96-well V-bottom tissue culture plates (Costar). 
2S Add 1/10 vol. 3M sodium acetate (pH S.2) and equal volume isopropanol. Store at -20 
C for a few hours. 

2. Centrifuge in Sorvall at 3500 RPM for 45 min. Rinse with 70% EtOH, 
centrifuge again and dry. 

2. Resuspend DNA in 12ul 3X SSC for a few hours and transfer to flexible 
30 U-bottom printing plates. 

4. Spot DNA onto poly-l-lysine slides with an arrayer. 
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Post processing 

1. Rehydraie airays by suspending slides over a dish of warm ddH20. (-'i 
minute) 

2. Snap-dry each array (DNA side up) on a lOOC hot plate for 3 seconds. 

5 3. UV cross-link DNA to the glass by using a Stratalinker set for 60 milliJoules 

4. Dissolve 5g of succinic anhydride (Aldrich) in 315mL of 
n-methyl-pyrrolidinone. 

5. To this, add 35mL of 0.2M NaBorate pH 8.0 (made by dissolving boric acid in 
water and adjustmg the pH witti NaOH), and stir until dissolved. 

10 6. Soak arrays in this solution for 15 minutes with shaking. 

7. Transfer arrays to 95C water baft for 2 minutes 

8. Quickly transfer arrjo^s to 95% EtOH for 1 minute. 

9. Remove excess liquid from slides by spinnmg the rack of slides on microtiter 

plate 

1 5 carriers at 500rpm. 

10. Arrays can be used immediately. 

Reagent Suppliers 

20 Microscope slides Goldseal brand. (Cat 3010) 

Poly-Hysine solution Sigma product number P8920 
Succmic Anhydride Aldrich product number 23,969-0 
N-Methyl-Pyrrolidinone Aldrich product number 32,863-4 

25 

Microarrays were prepared according to the above protocol using the 8498 
cDNA clones described above. All microarrays used in the experiments described 
herein were from a single print run batch of microarrays. 

30 Example 2 
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Cell Lines, Breast Tissue, and Breast Tumor Samples for Microairay Analysis and 
Preparation of mRNA Samples 

Common Reference Sample 
5 Each of tbe 84 experimental samples tested here was analyzed by a 

comparative hybridization, using a common reference RNA pool as a standard; this 
reference sample was composed of equal mixtures of mRNA isolated from 1 1 
established cell lines derived from human tissue (MCF7, Hs578T, OVCAR3, HepG2, 
NTERA2, MOLT4, RPMI-8226, NB4+ATRA, UACC.62, SW872, and Colo205: also 

10 see Table 2 for more details). The 1 1 cell Imes were all grown to 70-90% confluence 
in RPMI medium, contaming 10% Fetal Calf Serum and Penicillin/Streptomycin. The 
cells were harvested either by scrapmg or centrifiigation, quickly resuspended in RNA 
lysis buffer and mRNA prqjared using the FastTrack™ 2.0 mRNA Isolation Kit 
(Invitrogen, Carlsbad, CA) according to the manu&cturer's instructions. In each case, 

IS multiple individual mRNA preparations were collected for each cell line, which were 
then pooled togedier and analyzed via Northern analysis before final mixing to ensure 
the quality of the input mRNAs (e.g., to confirm that the mRNA exhibited a size 
distribution indicating that it was substantially nondegraded). The 1 1 mRNA samples 
were then mixed together in equal amounts, aliquoted in lOmM Tris (7.4), and stored 

20 at -80 C until use (2 micrograms of common reference sample was used per 
microarray hybridization and was always labeled usuig Cy3). 

Normal Breast Tissue 

Three samples of normal breast tissue were analyzed. Two of the samples 
25 were obtained from Clontech (Palo Alto, CA) and were pools of six (Normall) or two 
(Nonnal2) whole normal breasts. The thiril sample (Normal3) was obtmned from a 
single individual. 

Breast Tumor Samples 
30 The 40 individual breast tumor samples were collected at either Stanford 

University in Stanford CA, USA, or in the Haukeland University Hospital in Bergen, 
Norway. Twenty of the forty breast tumors were sampled twice as part of a larger 
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Norwegian study on locally advanced breast cancers (T3/T4 and/or N2 tumors) and 
have been described previously (Aas, T., ct a!., Nat, Med, 2, 81 i-814, 1996, the 
contents of which are incorporated herein by reference) ; these patients underwent an 
open surgical biopsy before treatment with doxorubicin monotherapy (range 12-23 
5 weeks), followed by the definitive surgical resection of the remaining tumor after 
therapy, and were evaluated for clinical responses according to UICC criteria 
(Hayward, J., et al., Bk J. Cancer, 35, 292-298, 1977). In addition to the 20 pafrs, 
there were 8 additional "before" specknens from Norway and 10 tumor specimens 
from Stanford (all Stanford tumors tested had a diameter of 3 cm or larger). Finally, 2 
10 of aie 10 Stanford tumor specunens assayed were also paired with a lymph node 
metastasis from the same patient. 

mRNA Isolation from Breast Tumor and Tissue Samples 

Following their excision, breast tumor samples were rq)idly fix>z^ in liquid 
15 N2 and then stored at -80 C until use. mRNA was isolated from breast tumors and 
normal breast tissue using the Trizol Reagent (Gibco-BRL) and Invitrogen FasfTmck 
2.0 Kit (all Stanford samples, and see httpiZ/genome- 

www.stanford.edu/sbcmp/web.shtml for the detailed protocol) or usmg the Trizol 
Reagent followed by Dynal bead separation for the mRNA purification step (all 

20 Norway tissue samples). Briefly, frozen tumor samples were cut into small pieces and 
immediately placed mto 12 ml of Trizol Reagent Each tumor sample in Trizol was 
homogenized using a PowerGen 125 Tissue Homogenizer (Fisher Scientific), and 
total RNA was isolated according to the Trizol reagent manufecturer's protocol. 
Tumor mRNA was isolated according to the manu&ctur^'s protocols using the 

25 FastTrack 2.0 Kit (Invitrogen) or Dynal beads. 

Example 3 

Characterization of Breast Tissue and Tumor Samples 

30 
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For all but two of the tumor specimens (i.e. New Yoric 1 and New York 2), the 
mutational status of the TP53 gene was determined using published methods (Aas, T., 
etal). 

A single pathologist (applicant Matt van de Rijn) reviewed hematoxylin and 
5 eosin (H&E) sections of each tumor, including all before and after pairs, and made a 
histological evaluation of each while blinded to the source. Tiunors were graded using 
a modified version of the Bloom-Richardson method (Robbins, P., et al.. Hum Pathol^ 
26, 873-879, 199S). These data are displayed in Table 4. Repres^tative H&E 
sections of each tumor are posted on Applicants' website at bttp://genome- 

10 www.stanford.edu/molecularportraits/. 

Immunohistochemistry was performed as described previously (Perou, C, et 
a/., 1999; Bindl, J. and Wamke, R., Am J Clin Pathol, 85, 490-493, 1986, and 
Natkunam, Y., et al.. Am. J. Path., 156(1), 2000, the contents of which are 
incorporated herein by reference). The antibodies used included the conunercially 

1 5 available monoclonal antibodies CAMS .2 (specific for keratins 8/1 8, available from 
Becton Dickinson), anti-keratin 5/6 (available originally from Boehringer Mannheim, 
Indianapolis, IN, cat. no. 1273396 and now from Chemicon International, Temekula, 
CA ), anti-keratin 17 (clone E3, available from Dako, Carpinteria, CA, cat no. 
M7046), anti-CD3 (available from Dako), and anti-immunoglobulin light chain 

20 (A191,A193, available from Dako). These inmiunohistochemical methods were 
applied for all the immunohistochemical studies described in the present application 
unless otherwise stated. 

Example 4 

25 cDNA Synthesis and Labeling and Microarray Hybridization 

mRNA was isolated from breast tissue, breast tumor samples, and cell lines as 
described in Example 2. Fluorescently labeled cDNA was synthesized from the 
mKNA using a reverse transcriptase reaction that included dUTP labeled with either 
30 Cy3 or Cy5. For each hybridization experiment differentially labeled cDNA samples 
(an experimental sample and a reference sample) were pooled and hybridized to a 
cDNA microarray, which was then scanned as described in Example 4. The protocol 
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below provides details of the steps performed for cDNA synthesis and labeling and 
for uiicroaFfay hybfldl^ation. 

I . To set up for the reverse transcriptase (RT) reaction, combine the following (e.g., in 
5 an Eppendorf tube): 

(a) Anchored Oligo dT primer - 2 microliters at 2.5 micrograms/microliter or 
control - 2 microliters 

(b) mRNA - (whatever volume is needed to reach 1 .5-2 micrograms) 

(c) DEPC/H20 - add sufficient volume so that final volume is 16 microliters 
10 1. Heat at 70*^ C for 10 minutes 

3. Chill on ice for 1-2 mmutes 

4. Add the following RT reaction components to each individual tube: 

(a) 5X RT Buffer - 6 microliters 

(b) SOX dNTPs - 0.7 microliter - (500mm A,C,G, 200mm T) 
15 (c) Cy Dyes dUTP - 3 microliters - (either Cy3 or Cy5) 

(d) DTT Stock - 3 microliters - (comes with RT setup) 

(e) SupersCTipt H RT-1.7 microliters - (cat# 18064-014 Gibco-BRL) 

5. Mfacwell 

6. Incubate at 42** C for 1 hour 

20 7. Add another 1 microliter of Superscript 11 RT and mfac 

8. Incubate at 42** C for 1 more hour 

9. Degrade mRNA with 1 .5 microliters of IM NaOH / 2mM EDTA 

1 0. Incubate at 65^ C for 8 minutes (do NOT go TOO long here) 

I I. Add 15 microliters of O.IM HCL 

25 12. Add 450 microliters of TE (pH 7.4) to each sample and place each sample into a 
microcon-30 filter. 

13. Add 15 microliters of Human COTl DNA (Gibco-BRL = 1 microgram/microliter) 
to each sample in the microcon filter, 

14. Spin in Eppendorf centrifuge until volume equals about 50 microliters (8-10*) 
30 15. Remove flowthroughs, and pool Cy3 and Cy5 flowthroughs together for future 

recovery of Cy dyes (store at -20 ' C). 
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16. Invert microcons, recover labeled samples, and pool Cy3 and Cy5 samples 
together that will be used for an individual experiment, in a single microcon filter that 
was used in step 15. 

17. Add 500 microliters of T.E again, and spin xmtil final volume equals 8 microliters 
5 or less (BE VERY CAREFUL TO NOT SPIN THE SAMPLE DRY!!!) 

18. To the 8 microliter combined Cy3 + Cy5 sample, add the following: 

(a) Yeast tRNA - 1 microliter - (10 micrograms/microliter) 

(b) PolyA DNA - 2 microliters - (10 micrograms/microliter) 

(c) 20XSSC T 2 microliters - (FINAL SSC concentration ^proximately 3X) 
10 (d) 10% SDS - 03 microlitere 

FINAL VOLUME = 133 MICROLITERS 

19. Mix well. 

20. Heat sample at lOO*' C for 2 minutes, spin very briefly. 

21. Place samples at 4T' C for 20-30 miiiutes. 

15 22. During Step 21, prepare the necessary number of hybridization chambers (Custom 
made by Die-Tech, San Jose, CA (see "Drawings for custom parts at 
http://cmgm.stanford.edii/pbrown/mguide/HybChamber.pdf*) or purchased at Coming 
Costar, Acton, MA (CTM™ Hybridization Chamber, #2551), get 22mm X 22mm . 
coverslips ready, and get arrays ready. 

20 23. Add the 1 3 microliters of probe (i.e., labeled cDNA mixture) onto the center of the 
array while NOT actually touching the anray &ce with tfie pipette tip. 

24. Quickly and gently place the 22mm X 22mm glass#l coverslip onto the array 
&ce. 

25. Add about 15-20 microliters of 3XSSC in two drops onto the end of the array slide 
25 away from the actual array for hydration purposes. 

26. Assemble the hybridization chamber with the anay slide in it, and place into a 65 
C water bath overnight. 

27. Pull out the hybridization chamber and dry off the excess HjO. 

28. Disassemble the hybridization chamber, and quickly place the slides into a slide 
30 washing chamber that contains 2XSSC/0.05%SDS. Jiggle die slide holder up and 

down until the slide coverslip falls off. Repeat this individually for each array, one at 
a time, until all are done 
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29. Wash slides in IXSSC for 3-5 minutes. 

vvooii Diiuv^d ill w U.^yvoov^ lUl J-J U1I11UIC5, IW1C6. 

3 1 . Spin slides down in centrifuge at 200 RPM for 2 minutes. 
32.SCAN immediately. 

Example 5 

Collection, Processing, and Analysis of Data &om Microarray Hybridizations 

The cDNA microarrays were scanned with either a General Scanning 
(Watertown, MA) ScanAnray 3000 at 20 microns resolution, or with a prototype Axon 
Instruments (Foster City, CA) GenePix Scanner at 10 micron resolution. The output 
files, which were TIFF images, were then analyzed using the program ScanAlyze (M. 
Eisen; available at http://www.microarrays.orfi/software) . Fluorescent ratios and 
quantitative data on spot quality (see ScanAlyze manual) were stordd in a prototype of 
the AMAD database (M. Eisen; available at http;//www.microarravs.org/softwareV 
Areas of the array with obvious blemishes were manually flagged and excluded fi-om 
subsequent analyses. The primary data tables can be downloaded at http://genome- 
www.stanford.edu/molecularportraits/, in text/tab delimited format after obtaining a 
password. 

Data were extracted from the database in a single table, with each row 
representing an array element, each column a hybridization, and each cell the 
observed fluorescent ratio for the array element fai the appropriate hybridization. 
Previously flagged spots were excluded, as were spots that did not pass quality 
control. This table had 9216 rows and 84 columns. Array elements were removed if 
Ihey were not well measured in at least 80% of the hybridizations. The data table was 
split into tumors and cell lines, and the two subtables were separately median polished 
(the rows and columns were iteratively adjusted to have median 0) before being 
rejoined into a smgle table. Genes whose expression varied by at least 4-fold from the 
median m this sample set in at least three of the samples tested were selected for the 
analyses that led to the identification of polynucleotides encoding BSTP-Ras/Rerg 
(1753 genes satisfied these conditions). 
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Average-linkage hierarchical clustering, as implemented in the program 
Cluster (M. Eisen; http://www,microarrays,org/software) . was applied separately to 
both the genes and arrays. The results were analyzed, and figures generated, using 
TreeView (M. Eisen; http://www.microarravs.org/software) > 



Example 6 

Producing Antibodies to the BSTP-Ras/Rerg Polypeptide 

1 0 This example describes the preparation of a polyclonal antibody that binds to 

the BSTP-Ras polypeptide, i.e., an antibody that binds to a polypeptide comprising an 
ammo acid sequence as set forth in SEQ ID NO: 1. The example fiirtfaer describes 
affinity purification of the antibody. 

15 Materials 

• Anisole (Cat. No. A4405, Sigma) 

• 2,2'-azmo-di-(3-ethyl-benzdiiazoline-sulfon!c acid) (ABTS) (Cat. No. A6499, 
Molecular Probes Eugene, OR) 

• Activated Maleimide Keyhole Limpet Cyanin (Cat. No. 771 06, Pierce Ch^ical 
20 Go. Rockford, IL) 

• Biotin (Cat. No. B2643, Sigma) 

• Boric acid (Cat. No. B0252, Sigma) 

• Sepharose 4b (Cat. No. 17-0120-01; LKB/Pharmacia, Uppsala, Sweden) 

• Bovine Serum Albumin (LP) (Cat. No. 1 00 350, Boehringer Mannhehn, 
25 Indianapolis, IN) 

• Cyanogen bromide (Cat. No. C6388 Sigma, St Louis, MO) 

• Dialysis tubing Spectra/Por Membrane MWCO: 6-8,000 (Cat. No. 132 665, 
Spectrum Industries Inc., Laguna Hills, CA) 

• Dimethyl formamide (DMF) (Cat. No. 22705-6, Aldrich Chemical Company, 
30 Milwaukee, WI) 

• Die (Cat. No. BP 592-500, Fisher) 

• Ethanedithiol (Cat No. 39,802-0, Aldrich Chemicals, Milwaukee, WI) 
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• Ether (Cat. No. TX 1275-3, EM Sciences) 

• Eihylenediaminetetraacetaiic acid (EDTA)(Cat No. BP 120-1, Fisher Scientific, 
Springfield, NJ) 

• l-ethyl-3-(3'dimethylaminopropyl>carbodiimide, HCL (EDC) (Cat No. 341-006, 
Calbiochem, San Diego, CA) 

• Freund's Adjuvant, complete (Cat. No. M-0638-50B, Lee Laboratories, Grayson, 
GA) 

• Freund's Adjuvant, incomplete (Cat. No. M0639-50B, Lee Laboratories) 

• Fritted chromatogr^hy columns (Column part No. 12131011; Frit: Part No. 
12131029, Varian Sample Preparation Products, Harbor City, CA) 

• Gelatin fi^om Bovine Skin (Cat. No. G9382, Sigma) 

• Glycine (Cat. No. BP381-5, Fisher) 

• Goat anti-rabbit IgG, biotinylated (Cat No. A 041 8, Sigma) 

• HOBt (Cat. No. 01-62-0008, Calbiochem-Novabiochem) 

• Horseradish peroxidase (HRP) (Cat No. 814 393, Boehringer Mannheim) 

• HRP-Streptavidin (Cat No. S 5512, Sigma) 

• Hydrochloric Acid (Cat No. 71445-500, Fisher) 

• Hydrogen Peroxide 30% wAv (Cat No. H1009, Sigma) 
» Methanol (Cat No. A412-20, Fisher) 

> Microtiter plates, 96 well (Cat No. 2595, Coming-Costar Pleasanton, CA) 

» N-a-Fmoc protected ammo acids available from Calbiochem-Novabiochem, San 
Diego, CA. See 1997-1998 catalog pages 1-45. 

> N-a-Fmoc protected ammo acids attached to Wang Resin available from 
Calbiochem-Novabiochem. See 1997-1998 catalog pages 161-164. 

> NMP (Cat No. CAS 872-50-4, Burdick and Jackson, Muskegon, MI) 

> Peptide (Svntfaesized bv Research (renAHciQ 

- — —7 .wvM**.^ » riy 

> Piperidine (Cat No. 80640, Fluka, available through Sigma) 
' Sodium Bicarbonate (Cat No. BP328-1 , Fisher) 

Sodium Borate (Cat No. B9876, Sigma) 
Sodium Carbonate (Cat. No. BP357-1, Fisher) 
Sodium Chloride (Cat. No. BP 358-10, Fisher) 
Sodium Hydroxide (Cat No. SS 255-1, Fisher) 
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• Streptavidin (Cat. No. 1 520, Boehringer Mannheim) 

• Thioanisole (Cat. No. T-2765, Sigma) 

• Trifluoroacetic acid (Cat. No. TX 1275-3, EM Sciences) 

• Tween-20 (Cat. No. BP 337-500, Fisher) 

5 • Wetbox-(Rubbermaid Rectangular Servin' Saver™ Part No. 3862 Wooster, OH) 

Solutions 

• BBS - Borate Buffered Saline with EDTA dissolved in distilled water (pH 8.2 to 
8.4withHClorNaOH) 

10 -25 mM Sodium borate (Borax) 

-100 mM Boric Acid 
-75mMNaCI 
-SraMEDTA 

• 0.1 NHCl in saline 

15 -concentrated HCl (8.3 mL/0.917 L distilled water) 

-0.154 MNaCl 

• Glycine (pH 2.0 and pH 3.0) dissolved in distilled wat^ and adjusted to the 
desired pH. 

-0.1 M glycine 
20 -0.154 MNaCl 

• 5X Borate IX Sodium Chloride dissolved in distilled water. 

-0.11 MNaCl 

-60 mM Sodium Borate 

-250 mM Boric Acid 

25 • Substrate Buffer in distilled water adjusted to pH 4.0 with sodium hydroxide: 
-50 to 100 mM Citric Acid 

Peptide Synthesis Solutions 

• AA solution: HOBt is dissolved in NMP (8.8 grams HOBt to 1 liter NMP). 
30 Fmoc-N-a-amino at a concentration at .53 M. 

• Die solution: 1 part DIC to 3 parts NMP. 

• Deprotecting solution: 1 part Piperidine to 3 parts DMF 
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• Reagent R: 2 parts anisole, 3 parts ethanedithiol, 5 parts thioanisole, 90 parts 

tn£iuCrOaC0iiC aciu. 
Equipment 

.5 • MRX Plate Reader (Dynatech Inc., Chantilly,VA) 

• Hamilton Eclipse (Hamilton Instruments, Reno, NV) 

• Beckman TJ-6 Centrifuge, Refrigerated (Model No. TJ-6, Beckman Instruments, 
FuUerton, CA) 

• Chart Recorder (Recorder 1 Part No. 18-1001-40, Pharmacia LKB Biotechnology) 
10 • UV Monitor (Uvicord SH Part No. 18-1004-50, Phamiacia LKB Biotechnology) 

• Amicon Stirred Cell Concentrator (Model 8400, Amicon Inc., Beverly, MA) 

• 30 kD MW cutH)fFfilter (Cat No. YM-30 Membranes Cat. No. 13742, Amicon 
Inc., Beverly, MA) 

• Multi-channel Automated Pipettor (Cat No. 4880, Coming Costar Inc., 
IS Cambridge, MA) 

• pH Meter Coming 240 (Commg Science Products, Coming Glassworks, Coming, 
NY) 

• ACT396 peptide synthesizer (Advanced ChemTech, Louisville, KY) 

• Vacuum dryer (Box is from Labconco, Kansas City, MO; Pump is from Alcatel, 
20 Laurel MD). 

• Lyophilizer (Unitop 600sl m tandem with Freezemobile 12, both from Vfrtis, 
GardinCT, NY) 

Methods 

25 Peptides were selected usmg the program Omiga ™i . i (Oxford Molecular Group, 
Inc., 2105 So. Bascom Ave., Suite 200, Campbell, CA 95008) using the HoppAVoods 
method, which is described in Hopp TP, Woods KR, Mol Immunol Apr;20(4):483-9 
A computer program for predicting protein antigenic deteraiinants, 1983, and Hopp 
TP and Woods KR, Proc. Nat Acad ScL U.S.A. 78, 3824-3828, 1981. Three peptide 

30 sequences were selected. The sequences were selected from regions of the 

polypeptide that displayed minimal homology with known proteins. The sequences of 
the three peptides were as follows: 
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Peptide 1(SEQ ID NO 
Peptide 2 (SEQ ID NO: 
Peptide 3 (SEQ ID NO: 



MAKSAEVKLADFGRAGVGK 

VLPLKNILDEIKKPKN 

YELCREVRRRRMVQGKT 



Synthesis of Peptides 

Each of the three peptides listed above was synthesized according to the following 
protocol: 

Incubate: Resin was immersed in appropriate solution. All incubation steps occurred 
10 with mixing. 

Wash: Added 2 mis. DMF, incubated 5 minutes and dramed. 
Wash Cycle: Five washes. 

Machine Synthesis 

IS The sequence of the desired peptide was provided to the peptide synthesizer. The C- 
terminal residue was determined and the appropriate Wang Resin was attached to the 
reaction vessel. The peptides were synthesized C-terminus to N-terminus by adding 
one amino acid at a time using a synthesis cycle. Which ammo acid is added was 
controlled by the peptide synthesizer, which looks to the sequence of the peptide 

20 entered into its database. 

Step 1 - Resin Swelling: Added 2 mL DMF, incubated 30 minutes, drained DMF. 
Step 2 - Synthesis cycle 

2a - Deprotection: 1 mL deprotecting solution was added to the reaction 
25 vessel and incubated for 20 minutes. 

2b -Wash Cycle 

2c - Coupling: 750 mL of amino acid solution and 250 mL of DIC solution 
were added to the reaction vessel. The reaction vessel was incubated for 
thu^ minutes and washed once. The coupling step was repeated once. 
30 2d -Wash Cycle 

Step 2 was repeated over the length of the peptide. The amino acid solution changed 
as the sequence listed in peptide synthesizer dictated. 
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Step 3 - Final Deprotection: Steps 2a and 2b were performed one last time. 

Resins were deswelled in methanol— rinsed twice in 5 mL methanol, incubated 5 
minutes in 5 mL methanol, rinsed in 5 mL methanol — ^and then vacuum dried. 

5 

Peptide was removed from the resin by incubating 2 hours in reagent R and then 
precipitated mto ether. Peptide was washed in ether and then vacuum dried. Peptide 
was resolubilized in diH20, frozen, and lyophilized overnight 

10 Conjugation of Peptide with Keyhole Limpet Hemocvanin 

Peptide (6 mg) was dissolved in PBS (6 mL) and mixed with 6 mg of maleiimide 
activated KLH carrier m 6 mL of PBS for a total volume of 12 mL. The entire 
solution was mixed for two hours, dialyzed in IL PBS, and lyophilized. 

IS Inmiunization of Rabbits 

Two New Zealand White Rabbits were mjected with 250 jtg keyhole limpet 
hemocyanin (KLH) conjugated peptide in an equal volume of complete Freund's 
adjuvant and saline in a total volume of 1 mL. Antigens (KLH-Peptide, 100 each) 
in an equal volume of incomplete Freund's Adjuvant and saline were injected into 

20 three to four subcutaneous dorsal sites for a total volume of 1 mL two, four, and sfac 
weeks after the first immunization. The three peptides were injected together. 

The immunization schedule was as follows: 



Day 0 


Pre-immune bleed, primary immunization 


Day 15 


1st Boost 


Day 27 


1st Bleed 


Day 44 


2nd Boost 


Day 57 


2nd Bleed and 3rd Boost 


Day 69 


3rd Bleed 


Day 84 


4th boost 


Day 98 


4lh bleed 
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The Collection of Rabbit Serum 

The rabbits were bled (30 to 30 mL) from the auricular artery. The blood was allowed 
to clot at room temperature for IS minutes and the serum was separated from the clot 
5 using an lEC DPR-6000 centrifuge at SOOO x g. Cell-free serum was decanted gently 
into a clean test tube and stored at -20°C for afSnity purification. 

Determination of Antibody Titer 

All solutions with the exception of wash solution were added by the Hamilton 
10 Eclipse, a liquid handling dispenser. The antibody titer was determined in the rabbits 
using an ELISA assay with peptide on the solid phase. Flexible high binding ELISA 
plates were passively coated with peptide diluted in BBS (1 00 ^L, 1 ^gAvell) and the 
plate was incubated at 4^C in a wetbox overnight (air-tight container with moistened 
cotton balls). The plates were emptied and then washed three times with BBS 
IS containing 0.1% Tween-20 (BBS-TW) by repeated filling and emptymg using a semi- 
automated plate washer. The plates were blocked by completely filling each well with 
BBS-TW containmg 1% BSA and 0.1% gelatin (BBS-TW-BG) and incubating for 2 
hours at room temperature.- The plates were emptied and sera of both pre- and post- 
immune serum were added to wells. The first well contained sera at 1:S0 in BBS. 
20 The sera were then serially titrated eleven more times across the plate at a ratio of 1 : 1 
for a final (twelfth) dilution of 1 :204,800. The plates were incubated overnight at 
4*'C. The plates were emptied and washed three tunes as described. 

Biotuiylated goat anti-rabbit IgG (100 ^L) was added to each microtiter plate test well 
2S and incubated for four hours at room temperature. The plates were emptied and 

washed three times. Horseradish peroxidase-conjugated Streptavidin (100 diluted 
1:10,000 in BBS-TW-BG) was added to each well and incubate for two hours at 
room temperature. The plates were emptied and washed three times. TheABTSwas 
prepared fresh from stock by combining 10 mL of citrate buffer (0.1 M at pH 4.0), 0.2 
30 mL of the stock solution (15 mg/mL in water) and 10 |iL of 30% HjOj. The ABTS 
solution (lOO^L) was added to each well and incubated at room temperature. The 
plates were read at 414 X, 20 minutes following the addition of substrate. 
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The affinity column was prepared by conjugating 5 mg of peptide to 10 mL of 
cyanogen bromide-activated Sepharose 4B, and 5 mg of peptide to hydrazine- 
5 Sepharose 4B. Briefly, 100 uL of DMF was added to peptide (5 mg) and the mixture 
was vortexed until the contents were completely wetted. Water was then added (900 
fiL) and the contents were vortexed until the peptide dissolved. Half of the dissolved 
peptide (500 jiL) was added to separate tubes contaming 10 mL of cyanogen-bromide 
activated sepharose 4B in 0.1 mL of borate buffered saline at pH 8.4 (BBS), and 10 

1 0 mL of hydrazine-Sepharose 4B in 0. 1 M carbonate buffer adjusted to pH 4.5 using 
excess EDC in citrate buffer pH 6.0. The conjugation reactions were allowed to 
proceed overnight at room temperature. The conjugated sepharose was pooled and 
loaded onto fritted columns, washed with 10 mL of BBS, blocked with 10 mL of 1 M 
glycine, and washed with 10 mL 0.1 M glycine adjusted to pH 2.5 with HCl and re- 

15 neutralized fa BBS. ITie column was washed with enough volume for the optical 
density at 260X to reach baselme. 

AfBnity Purification of the Antibody 

The peptide afiBnity column was attached to a UV monitor and chart reooitler. 

20 

The titered rabbit antiserum was thawed and pooled. The serum was diluted with one 
volume of BBS and allowed to flow through flie columns at 10 mL per mmute. The 
non-peptide immunoglobuluis and other protems were washed from the column with 
excess BBS until the optical density at 280 X reached baselme. The columns were 
25 disconnected and the affinity purified column was eluted using a stepwise pH gradient 
from pH 7.0 to pH 1.0. The elution was monitored at 280 nM, and fixictions 
containing antibody (pH 3.0 to pH 1.0) were collected directly mto excess 0.5 M 
BBS. Excess buffer (0.5 M BBS) in tiie collection tubes served to neutralize flie 
antibodies collected in tiie acidic fractions of the pH gradient 

30 

The entire procedure was repeated with "depleted" serum to ensure maximal recovery 
of antibodies. The eluted material was concentrated usmg a stirred cell apparatus and 
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a membrane with a molecular weight cutoff of 30 kD. The concentration of the final 
preparation was determined using an optical density reading at 280 nM. The 
concentration was determined using the following formula: mg/mL = OD28o/l*4. 

S Example? 

SDS-PAGE and Immunoblot Analysis of BSTP-Ras 

This example describes in detail one approach to investigating the expression 
pattern of BSTP-Ras. Example X describes similar experiments that have been 
10 performed. 

Extracts are made from a variety of different cell lines and subjected to SDS-* 
PAGE followed by immunoblotting according to the protocol below, using an afiSnity 
purified polyclonal antibody to BSTP-Ras prepared as described in Example 6. 



IS Materials 

• Acetic acid. Glacial (Cat. No. A38*-212, Fisher) 

• Aciylamide (Cat No. A-3553, Sigma) 

• Anti-Rabbit IgG(H&L) (Cat. No. 31460ZZ, Pierce) 

• Bis-acrylamide (Cat No. M-7279, Sigma) 

20 • Blotting paper (Cat No. 170-3960, Bio-Rad, Hercules, CA) 

• Bovine Serum Albumm (LP) (Cat No. 100-350, Boehringer Mannheim, 
Indianapolis, IN) 

• Brilliant Blue R-250 (Cat. No. BPlOl-25, Fisher) 

• Complete™ Mini (Cat No. 1836153, Boehringer Mannheim) 

25 • ECL Western Blotting Detection Reagents (Cat. No. RPN2106, Amersham 
Pharmacia Biotech, Piscataway, NJ) 

• Ethyl alcohol (AAPER Alcohol and Paper Chemical Co., Shelbyville, KY) 

• Gelplate Clean (Cat. No. 786-140RF, Geno Technology, Inc., St. Louis) 

• Gelatin (Cat. No. G-2500, Sigma) 
30 • Glycerol (Cat No. BP229-1, Fisher) 

• Glycine (Cat No. G-8898, Sigma) 

• Hybond ECL (Cat No. RPN303D, Amersham Pharmacia Biotech) 



96 



wo 02/08282 



PCT/USOl/23574 



• Lauryl Sulfate (SDS) (Cat. No. L-3771, Sigma) 

• Methanol (Cat. No. BPl 105-4, Fisher) 

• M-Per (Cat. No. 78501, Pierce, Rockford,IL) 

• Nalgene bottle top filters (Cat No. 09-740-62B, Fisher) 
5 • Nonfet dry milk (Kroger Co., Cincinnati, OH) 

• Ponceau-S (Cat. No. P-07170, Sigma) 

• Potassium phosphate (Cat No. P-0662, Sigma) 

• 2X SDS gel loading buffer (Cat. No. 750006, Research Genetics, Huntsville, AL) 

• Size markers (Cat No. M-3913, M-4038, M-3788, Sigma) 
10 • Sodium azide (Cat No. S227I-25, Fish) 

• Sodium chloride (Cat No. S271-3, Fisher) 

• Sodium phosphate. Dibasic, Anhydrous (Cat No. BP332-1, Fisher) 

• t-amyl alcohol (Cat No. A-16852, Sigma) 

• TEMED (Cat No. T-9281, Sigma) 

15 • Trizma® Base (Cat No. T-6066, Sigma) 

• Tween-20 (Cat. No. BP337-500, Fisher) 

Solutions 

• PBS - Phosphate Buffered Saline dissolved m distilled wat«- 
20 -136mMNaCl 

-2,7mMKCl 

-10.1mMNa2HPO4 

-1.8mMKH2P04 

• Acrylamide/Bis (30% T, 2.67% C) dissolved in distilled water 
25 -4. 1 M acrylamide 

-5L9mMN-N'- 

• 1 .5 M Tris-HCl (pH 8.8) dissolved in distilled water 

• 0.5 M Tris-HCl (pH 6.8) dissolved in distilled water 

• 10% SDS - dissolve 10 grams SDS in 100 mis distilled water 
30 • Running Buflfer 

-24.8 mM Tris base 
-191.9 mM glycine 
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-3.5 mM SDS 

• Towbin transfer buffer (pH 8.3) dissolved in distilled water 
-20% methanol 

-25 mM Tris 
5 -192 mM glycine 

• Equilibrating buffer for gel drying, mixed in distilled water 
-20% ethanol 

-10% glycerol 

• Gel staining solution dissolved in distilled water 
10 -0.3 mM Coomassie brilliant blue R-250 

-40% methanol 

-7% glacial acetic acid 

• Gel destaining solution mbced in distilled water 
-25% methanol 

1 S -7% glacial acetic acid 

• 10%Tween*20inPBS 

• 5%NonfetdrymilkinPBS 

• 0.2% BSA Blocking Buffer dissolved in PBS 
-0.2% BSA 

20 -0.1% gelatin 

-0.05%Tween®20 

• Wash Buffer 
-0.05%Tween*20 
-IX PBS 

. 25 Equipment 

• Microcentrifuge (Model 5415, Eppendorf) 

• Power Pak 200 (Cat No. 165-5052, Bio-Rad) 

• Power Pak 3000 (Cat. No. 165-5056, Bio-Rad) 

• Protean II xi Cell (Cat. No. 165-1813, Bio-Rad) 

30 • Recirculating chiller (Cat. No. CFT33D1 15V, Neslab Instruments, Inc., 
Portsmouth, NH) 

• 20- Well comb (Cat. No. 165-1 867, Bio-Rad) 
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• pH Meter Corning 240 (Corning Science Products, Coming Glasswares, Coming, 

• Air Cadet vacuum pump (Cat. No. P-07530-50, Cole-Palmer Instmments Co., 
Chicago, \L) 

5 • Tissue Tearor tissue homogenizer (Cat No. 985370-07, BioSpec Products Inc., 
Bartletsville, OK) 



Methods 

Sample Preparation 

10 Preferably a variety of cell lines known in the art are used for the experiment, 

including cell lines derived from breast tumors, cell lines derived from normal breast, 
tissue, cell lines derived from other cancer types, etc. A selection of appropriate 
cancer cell lines for investigation of the expression of BSTP-Ras is found in reference 
21. A selection of noncancer cell lines appropriate for investigation of the expression 

1 5 of BSTP-Ras is found in Perou, et al.. Molecular portraits of human breast tumours, 
Nature, 406(6797):747-52, 2000. Cell lines are maintained under standani growth 
conditions and in standard tissue culture media as appropriate for the particular cell 
line. Cells are collected accordmg to standard techniques (e.g., trypsinization in the 
case of adherent cells), and the resulting cell suspension is prepared as follows: 

20 -The cell suspension is pelleted by centrifugation at 3000 RPM for 10 mmutes, and 
the supernatant discarded. 

-The pellet is washed with 1ml PBS, centrifiiged at 10000 RPM for 10 minutes, and 
the supernatant discarded. 

-An appropriate volume of M-Per™ Reagent is added to the cell pellet and mixed 
25 gently for 10 minutes in an ice bath. The mixture is centrifuged at 13200 RPM for 15 
minutes, and the supernatant is saved. 

The protem concentration in the supernatant is measured according to standard 
techniques. 

All samples are mixed at 1:1 with gel loading bujffer and boiled for 5 minutes before 
30 loading. 
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. SDS PAGE 

Standard SDS-PAGE stacking and running gels are prepared and placed in an 
electrophoresis apparatus. After filling the upper and lower chambers with running 
buffers the samples (60 fig/Iane) are loaded. The inner core is placed in the lower 
5 chamber and the lid placed on top. The apparatus is connected to the power supply 
and recirculating system. The temperature setting is 10*C. The stacking gel is run at 
14mA per gel for 1 hour. The separatmg gel is run at 0.58mA per gel per hour for 16 
hours. 

10 Transfer to nitrocellulose 

After electrophoresis is complete, the gel is equilibrated in Towbin Buffer for 
15-30 minutes. The assembly for transfer is as follows: 
cathode 

pre-soaked blotting paper 
15 gel 

pre-wetted nitrocellulose 
pre-soaked blotting paper 
anode 

The transfer is performed at 20V for 25 minutes, then 25V for 20 minutes. After the 
20 transfer is complete, the gel is stained with Coomassie and the blot is stained with 
Ponceau-S. 

Western Blotting 
Primary and secondary antibodies 
25 All primary and secondary antibodies are diluted in 02% BSA blocking 

buffer. All incubation steps arc done with gentle mixing. 

Blots are blocked in 5% milk overnight at room temperature. The blots are rmsed with 
wash buffer before adduig the primary antibody and incubating for two hours at room 
temperature. 

30 One wash cycle is performed. One wash cycle consists of: 
Wash 5 min, rinse 
Wash 5 min, rinse 
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Wash 10 min, rinse 

«« %M^AA •/ AAtAAif A mow 

Wash 5 min, rinse 

The secondary antibody is added and incubated for one hour at room temperature. 
5 One wash cycle is then performed. 
Peptide Block 

As a control to demonstrate the specificity of the antibody, equal amounts 
(w/w) of peptide and antibody are added to 1/10 of the final volume of blocking 
buffer and incubated overnight at 4°C. The volume of blocking buffer is then brought 
10 up to the final volume, and the membrane is incubated for an additional two hours at 
room temperature. 

Developmg 

The blots are placed in a Ziploc® bag. Equal volumes of ECL western blotting 
15 detection reagents are mixed and distributed evenly over tiie blots. The blots are 
placed in an autoradiography cassette, covered with a piece of film, and exposed. 

Example 8 

Detection of BSTP-Ras in Breast Cancer Samples by Immunohistochemistry 

20 

A breast cancer tissue microarray consisting of tissue samples firom a large 
number of breast cancer biopsies is prepared essentially as described m Kononen, J., 
et al.. Tissue microarrays for high-throughput molecular profiling of tumor specbnens, 
Nature Medicine, 4(7), 844-847, 1998. Briefly, several hundred archival parafSn- 

25 embedded breast tumor samples were obtained fi-om the Pathology Department at 
Stanford University Medical Center. The samples were reviewed by a pathologist 
(qjplicant MVR) to ensure that they met pathological criteria for breast cancer Small 
tissue cores were r^oved fi-om the samples and embedded in a single pamfBn block 
to produce a tissue array. Immunohistochemistry is performed as described 

30 previously (Perou, C, et al., 1999; Bindl, J. and Wamke, R., Am J Clin Pathol, 85, 
490-493, 1986, and Natkunam, Y., et al.. Am. J. Path., 1 56(1), 2000, using an 
antibody to BSTP-Ras/Rerg generated as described in Example 6. 
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Example 9 

BST'RAS/RERG Expression in Primary Breast Tumors, Breast-Derived Cell Lines, 

and Human Tissues 

5 

Materials and Methods 
Microarray Data Analysis 

A new analytical method called Statistical Analysis of Microarrays (SAM; 
described in Tusher, V,, Tibshirani, R., and Chu, G., Proc Natl Acad Set USA 98(9), 
10 51 16-5121, 2001) was used to analyze microarray data from 1753 differentially 
expressed genes, obtained as described in Example 5. The 1753 genes and their 
relatiye expression levels are presented in Table 1. 

RNA Isolation and Northern Blot Analysis 

Human breast-derived cell Imes were obtained from the American Type Culture 
Collection (ATCC) and were maintained in either RPMI or oMEM with 10% fetal calf 
serum. To analyze Rerg expression, RNA was isolated by acid-phenol extraction as 
described (Chirgwin, J. M., Przybyla, A. E., MacDonald, R. J., and Rutter, W. J. (1979) 
Biochemistry 18(24), 5294-9). Total RNA (-25 ^ig) was resolved on a 1.3% 
agarose/formaldehyde gel, denatured, and transferred to Hybond-N (Amersham 
Pharmacia Biotech). [a'^P]dCTP labeled Rerg or GAPDH cDNA was prepared with 
Random Prime Kit (BMB), purified with Sephadex G50 'TsIICK" column (Pharmacia), 
and hybridized to the RNA samples using PerfectHyb (Sigma), all according to the 
mstructions of the respective manu&cturers. 

To further analyze Rerg mRNA expression, a similar process to that described 
above was used to probe a himian multiple tissue Northern blot (Clontech). In addition, 
Rerg expression was investigated by screening the dbest database. 

Results 

30 Data analysis using SAM resulted in the identification of 264 genes whose 

expression correlated with overall patient survival. Of particular interest were genes 
in which high expression was associated with favorable patient outcomes and the 
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longest relapse free survival times. A single cluster of genes, the "luminal 
epiiheiiai/'ER4-" cluster, described more fully in copending U.S. Patent Application 
"Reagents and Methods for Use in Managing Breast Cancer, filed July 26, 2001 and 
in U.S. Provisional Patent Application Sen No. 60/220,967, filed July 26, 2000, 
S contained most of the genes whose high expression was associated with long survival 
times. Hius data analysis using SAM confirmed the initially recognized correlation 
between longer survival times and high expression of genes in the luminal 
epithelial/ER+ cluster. Tumors that most highly expressed the lummal/epithelial set 
of genes were, on average, some of the least proliferative tumors. BST-RAS/RERG 

10 was included in both the "luminal epithelial/ER+'' gene cluster and the 264 gene 
survival-associated cluster. In addition to Rerg, these clusters mcluded the ER and 
also a known estrogen-regulated gene (LIV-1). The mRNA expression pattern of 
Rerg across the 78 breast tumors described in Example 2 was a statistically significant 
predictor of patient outcome in a univariate analysis (p=0.0037), however, as would 

IS be expected based on its expression pattern, it was not an independent predictor of 
outcome in a multivariate analysis due to its correlation with ER expression 
(p=0.0001). 

As shown in the upper panel of Figure 4 (lanes 3, 5, 6, and 7), Rerg mRNA 
was detected in all of the estrogen receptor positive breast derived cell lines (MCF- 

20 lOA, BT474, MCF-7, and T47D). As shown m lanes 4, 8, 9, and 10, I^rg mRNA 
was not detected in any of the ER negative cell lines examuied (SK-BR-3, MDA-MB- 
23 1, Hs578T, and BT-549). Rerg mRNA was also not detected in rat intestinal 
epithelial cells (RJDE-l, lane 2). The lower panel of Figure 5, probed wifli labeled 
G4PD/f cDNA, shows that equal amounts of RNA were loaded in each lane. 

25 i&rg mRNA was detected by Northern analysis in human heart, brain, 

placenta, lung, liver, skin, kidney, and pancreas, with tiie highest levels of expression 
in heart, kidney, and pancreas. A potentially ubiquitous pattern of expression was 
supported by Unigene/EST expression data (Hs.l27756 and Hs.21594), which 
hidicated iSasXRerg is found in brain, kidney, muscle, prostate, uterus, adrenal gland, 

30 aorta, eye, lung, placenta, stomach, and testis. 
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Microarray Analysis Demonstrates that Rerg is an Estrogen-responsive Gene 
Materials and Methods 

All cDNA microarrays were produced and hybridized as described in Example 
land in either Perou, C. M., et al. Nature 406, 747-752, 2000 or Perou, C. M., et al. 
5 Proc Natl Acad Sci USA 96(16), 9212-9217, 1999. For the MCF-7 estrogen add back 
experiments, the parental vector only MCF-7 control cell line (MCF-7-pCDNA3) 
generated as described in Example 11 was used and was first grown up to 50-70% 
confluency in RPMI + 10% Fetal Calf Serum (FSC) + pen/strep, at which point the cells 
were switched to phenol-red free RPMI medium that contained 10% dextran-charcoal 

10 stripped FCS (Hyclone, Logan UT) plus pen/step for 48 hr. Some of these estrogen 
deprived cells were then given 1 0"' M P-1 7-estradioi (Sigma) for 4 hr, 8 hr and 24 hr. For 
the tamoxifen treated samples, 70% confluent cultures of MCF-7 cells in RPMI + 10% 
FCS + pen/strep media containing phenol red, were treated with either 1 fmi or 10 }jm 
tamoxifen (Sigma) for 48 hr. 

15 mRNA was prepared from each sample using a FastTrack 2.0 mRNA Kit 

(Invitrogen) and compared on 23,000 clone, or 44,000 clone cDNA microarrays prepared 
as described in Example 1 and in Perou, C. M., etal, ProcNatlAcadSciUSA 96(16), 
9212-9217, 1999. In the case ofihe tamoxifen treated samples, the control untreated ceUs 
were labeled witii Cy3 and were compared to the tamoxifen treated samples that were 

20 labeled with Cy5; for the estrogen add back experiments, die 48 hr estrogen deprived 
samples were labeled with Cy3 and were compared to the estrogen treated samples (CyS). 

Results 

The location of Rerg within a gene expression cluster that includes the ER, the 
25 high expression of Rerg in ER-positive breast cell lines, and die presence of two potential 
ER binding sites within the promoter region suggested that Rerg might be an estrogen- 
responsive gene. To evaluate this possibility, cDNA microarray analysis was used to 
examine Rerg expression in response to ^-estradiol stimulation in the estrogen responsive 
MCF-7 cell line (see http://genome-www4.stanford.edu/MicroArray/SMD/ for all 
30 microarray primary data files). MCF-7 cells grown in estrogen-free media for 2 days 
were supplemented with P-estradiol (1 X 10"^M) for 4, 8, or 24 hr before harvesting 
mRNA and were compared to 48 hr estrogen deprived MCF-7 cultures. 
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As shown in Figure 5, Rerg was rapidly expressed in response to Prcstradiol 
treatment (i.9-foid stimulation after 4 hr and 3-fbld after 24 hr), suggesting that it is a 
direct target of activated ER. Tamoxifen, an ER antagonist in the breast, acts as an 
inhibitor of estrogen-induced responses including the regulation of ER-responsive genes, 
5 Thus, the effect of tamoxifen (1 X 10^ M) in the presence of estrogen was examined on 
global gene expression patterns. As expected for an ER-regulated gene, iZerg expression 
was completely repressed by tamoxifen and induced by estrogen, although not as . 
dramatically as other known estrogen regulated genes (see cathepsm D and pmtem kinase 
HI 1 for examples of how known ER-responsive genes behaved m these experiments) 
10 (Fig. 5). 

Example 1 1 

Subcellular Distribution of Rerg in Living Cells 

IS Materials and Methods 
Plasmids 

The Rerg coding sequence was amplified from I.M A.G£. EST clone 28777 
using primers that contain EcoRI-Ndel and BamHI restriction sites and was subcloned 
mto the correspondmg sites of the yeast expression vector pWHA, which contains a 

20 hemagglutanin (HA) epitope tag, to create pWHA-/terg. Next, HA-i&f;g was 
subcloned mto the EcoRI-BamHI sites of the human expression vector pCDNAS 
(Invitrogen) resultmg in pCDNA3-HA-iterg. For localization studies in live cells, 
Rerg in pKH3 (an expression plasmid used to generate fusions with the HAl epitope 
as described in MoL Cell Biol, 14, 7943, 1994) was used as a PCR template to 

25 generate open reading frame cassettes for subcloning into the enhanced GFP > 
expression vector pEGFP-C3 (Clontech). The cassettes contained 5' EcoRI and 3' 
BamHI restriction enzyme recognition sites and w^e directionally cloned 
downstream of and in fi^e with the green fluorescent protein (GFP) tag of the vector 
(generating plasmid pEGFP-i?erg) to express GFP-Rerg fusion protein. 

30 

Cell Culture for Transfections 
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NIH 3T3 mouse fibroblasts were maintained in Dulbecco's Modified Eagle 
Medium (DMEM) supplemented with 10% calf serum. MCF-7 human breast 
carcinoma cells used for Rerg overexpression studies were maintained in ocMEM 
supplemented with 10% FCS. MCF-7 cells overexpressing HA-tagged wild ^e 
5 Rerg (designated MCF-7-Rerg), or vector alone (designated MCF-7-pCDNA3), were 
generated by transfecting cells wilh either pCDNA3-HA-ite/;g or pCDNA3 
respectively. Stable MCF-7 transformants were selected for resistance to 0.5 mg/ml 
of G418 and individual clones were isolated using cloning discs. All transfections 
were performed using LipoFectamine Plus Reagent (GibcoBRL) according to the 
10 directions of the manufacturer. 

Localization of GFP-Rerg in Cultured Cells 

NIH 3T3, MCF-7, or MDA-MB-23 1 cells mamtained as described above wctc 
transiently transfected with 1 ng pEGFP or pEGYP-Rerg using the LipofectAMINE 

IS PLUS Reagent (GibcoBRL) in serum-free medium according to manu&cturer's 
protocol. Three hours post-transfection cells were then switdied to normal growth 
medium, and 20 hr post-transfection cells were viewed in a Zeiss Axiophot 
fluorescence microscope (63X Plan-APOCHROMAT objective) equipped with a 
cooled charge-coupled device (CCD) camera and digitized images were captured 

20 . using MetaMorph™ 4.1.4 digital imaging software (Universal Imaging Corp.). GFP 
expression was detected from 20 ^g of cell lysate by immunoblotting with anti-GFP 
monoclonal antibody (Clontech) followed by anti-mouse HRP-conjugated antibody 
and visualized by enhanced chemilummescence reagent (Pierce). 

25 Results 

To determine the subcellular localization of Rerg, a green fluorescent protein 
(GFP) tagged Rerg fusion protein was transiently expressed in the human breast 
tumor-derived cell lines MCF-7 and MDA-MB-23 1 and in NIH 3T3 mouse 
fibroblasts. As shown in Figure 7A, transient transfection with the GFP-Rerg vector 
30 (pEGFP'Rerg) led to the production of an intact GFP-Rerg fusion protein as 

determined by anti-GFP western blotting (lane 2 in each panel). Epifluorescence 
microscopy of live cells transiently expressing GFP-Rerg revealed a uniform 
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distribution throughout the cell, including signal localized in the nucleus (Figure 7B, 

Stably expressing HA-epitope tagged Rerg, and in a series of additional cell lines 
transiently expressing GFP-Rerg. No cells were found that exhibited a plasma 
5 membrane localization of Rerg, which is consistent with the lack of a COOH-terminal 
prenylation signal sequence. This distribution is similar to that seen in cells expressing 
GFP alone (Figure 7B, upper panels). 

The distribution of HA-epitope-tagged Rerg was also examined using 
biochemical fractionation and Western blot analysis and showed that HA-Rei:g was 
10 present predominantly in the cytosolic fraction of MCF-7 cell lysates, with a small 
amount in the nuclear fiction. 



Example 12 
Growtii Inhibition by Rerg Protein 

15 

This example demonstrates the ability of Rerg to inhibit both anchorage- 
dependent growth (i.e.» cell growth on the bottom tissue culture dishes in liquid 
medium) and anchorage-independent growth (i.e., cell growth in soft agar). 

20 Materials and Methods 

Western Blot Analysis of HA-Rerg 

Total cell lysates were prepared by first growing cells to 80% confluence on 
100 mm plates. Next, the cells were washed one time with IX PBS and incubated on 
ice with 1 ml of RIPA buffer (150 mM NaCl, 1% NP.40, 0.5% DOC, 0.1% SDS, and 

25 50 mM Tris, pH 7.5) for 30 min with occasional shaking. Cells were tiien spun for 10 
min at 10,000 X g, and the protem concentration of the lysate determined by the 
Bradford Assay (BioRad). 40 fxg of cell lysate was resolved on al2% SDS- 
polyacrylamide gel and HA-Rerg expression analyzed by Western blotting using the 
anti-HA.l 1 monoclonal antibody (BAbCo). 

30 Anchorage-dependent and anchorage-independent growth assays 

MCF-7 cells stably expressing empty vector pCDNA3 (control) or 
overexpressing HA-Rerg (clones 1C4, 1C2, or 2B6) were generated as described m 
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Example 11. Growth curves were generated for MCF-7 cells stably expressing HA- 
Rerg protein or vector controls as described in Clark, G. J., Cox, A. D., Graham, S. 
M., and Der, C. J., Methods Enzymol 255, 395-412, 1995. In brief, stable transfectants 
(MCF-7 cells stably expressing empty vector (control) or Rerg overexpressing clones 
5 1C4, 1C2, or 2B6 generated as described in Example 1 1) were seeded at 2x10* cells 
per well in tissue culture medium in 6 well plates, then trypsinized and counted from 
triplicate wells at 1, 2, 3, 4, 5, 6, and 7 days after seeding. 

Soft agar growth analyses were performed as described previously (Clark, G. 
J., Cox, A. D., Graham, S. M., and Der, C. J., Methods Emymol 255, 395-412, 1995). 
10 Briefly, 10" MCF-7 cells stably e3q)ressing empty vector pCDNA (control) or HA- 
Rerg overexpressing clones 1C4, 1C2, or 2B6, were seeded in duplicate into 03% 
bacto-agar (Difco) over a 0.6% agar bottom layer, wifli colonies quantitated after -20 
days. 

IS Results 

The level of expression of HA-Rerg in clonal cell lines generated by 
tranfecting MCF-7 cells with empty vector (pCDNA3) or HA-Rerg expression vector 
(pCDNA-HA-iterg) was evaluated by Western blot analysis. As shown in Figure 8A, 
Western blot analysis with anti-HA antibodies verified expression of HA-Rerg in 
20 three clonal Rerg overexpressing isolates, with the highest expression detected in 

clone 2B6, medium expression detected in clone 1C4, and lowest expression detected 
in clone 1C2. 

The consequences of constitutively high Rerg egression on the rate of cell 
proliferation was first assessed in cell cultures maintained on plastic. All three clonal 

25 cell lines expressing HA-Rerg showed reduced rates of cell proliferation when . 
compared to the vector-transfected control cell line (Figure 8B). A similar growth 
inhibition was observed with MCF-7 cells when Rerg rapresslon was transient, i.e., 
induced under the control of an inducible promoter. Second, the effect of Rerg 
overexpression on anchorage-independent growth potential of MCF-7 cells was 

30 determined. As shown in Figure 8C, MCF-7 cells expressing HA-Rerg showed greatly 
reduced colony formmg ejSiciency (50-80% reduction) in soft agar when compared to 
tfie control vector only MCF-7 cells. These results indicate that elevated Rerg 
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expression causes a growth-inhibitory effect, suggesting that loss of Rerg expression 
or inhibition of Rerg function or activity may faciiiiate tumor development and/or 
growth. 

5 Example 13 

Effect of Rerg Overexpression on Tumor Formation in Immunocompromised Mice 

Materials and Methods 

Tumorigenicitv in Immunocompromised Mice 

10 Tumorigenicity analyses were performed as described previously (Clark, G. J., 

Cox, A. D., Graham, S. M., and Der, C. J., Methods Emymol 255, 395-412, 1995), 
except that 0.72mg/60-day time-release 17-P estradiol pellets (Innovative Research of 
America) were implanted subcutaneously -24 hr prior to mjection of cells. MCF-7 
cells stably expressing empty vector (pCDNA) or clones overexpressing Rerg at 

15 medium or high levels (clones 1C4 and 2B6, respectively) were mjected 

subcutaneously into 6-8 week athymic nude (BALB/c nu/nu) female mice at 10* cells 
per site, at two sites per animal (dorsal flanks). Tumor formation was monitored at 
least twice a week and measured using calipers over the skin of the anunal from 9-70 
days post-injection. 

20 

Results 

As shown in Table 1, tumors grew at each site injected with empty vector 
control MCF-7 cells with a latency of approximately 52 days. Tumors were also found 
in mice injected with cells from clone 1C4 (low HA-Rei^-expressing clone) with a 

25 similar latency. In contrast, mice injected with cells from the high HA-Rerg- 

expressing 2B6 clone remained tumor-free for an additional mon& (remaining tumor- 
free through the end of the observation period). Thus elevated Rerg expression 
inhibited tumor formation in nude mice, suggesting that loss of Rerg expression or 
inhibition of Rerg &nction or activity may facilitate tumor development and/or 

30 growth. 
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Table L Suppression of MCF-7 cell tumor fonnation in nude mice by R&rg 
expression 



Day* 


Vector 


Rerg 1C4 


Rerg 2B6 


9 


0/4 


0/4 


0/4 


17 


0/4 


0/4 


0/4 


22 


0/4 


0/4 


0/4 


31 


0/4 


0/3 


0/4 


37 


0/4 


0/3 


0/4 


43 


0/4 


1/3 


0/4 


47 


0/4 , 


1/3 


0/4 


52 


2/4 


1/3 


0/4 


57 


2/4 


2/3 


0/4 


65 


2/4 


2/3 


0/4 



5 ♦ post-injection 
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1 CLAIMS 

2 

3 We claim: 

4 1 . A substantially puri j5ed polypeptide whose sequence comprises the polypeptide 

5 sequence of SEQ ID NO: L 
6 

7 la. A fragment of the polypeptide of claim 1, wherein the fragment is at least SO 

8 amino acids in length. 
9 

10 lb. A fragment of the polypeptide of claim 1, wherein the fragment is at least 100 

11 amino acids in length. 
12 

13 Ic. A fragment of the polypeptide of claim 1, wherein the fragment is at least 150 

14 amino acids in length. 
15 

16 Id. A variant of the polypeptide of claim 1, wherein the variant includes between 1 

17 and 10 amino acid substitutions, inclusive. 
18 

19 le. A variant of the polypeptide of claim 1, wherem the variant includes between 11 

20 and 25 amino acid substitutions, inclusive. 
21 

22 If A variant of the polypeptide of claim 1, wherein the variant includes between 26 

23 and 50 amino acid substitutions, inclusive. 
24 

25 Ig. A variant of the polypeptide of claim 1, wherem the variant includes an addition 

26 or substitution of between 1 and 10 amino acids, inclusive. 
27 

28 Ih. A variant of the polypeptide of claim 1, wherein the variant includes an addition 

29 or substitution of between 1 1 and 25 amino acids, inclusive. 
30 

31 1 i. A substantially purified polypeptide having significant similarity to a polypeptide 

32 whose sequence is set forth in SEQ ID NO: 1, wherein a polypeptide is considered 



116 



wo 02/08282 



PCT/USOl/23574 



1 significantly similar if, when tiie amino acid sequence of the polypeptide is compared 

2 with me amino acid sequence of the polypeptide of SEQ ID N0:1 using the BLAST 

3 algorithm and the BLOSUM substitution matrix with default parameters, the result is 

4 a % identity greater than 60 or a % positive greater than 70 encompassing at least 25% 

5 of the length of SEQ ID NO:l, or both. 
6 

7 Ij. A substantially purified polypeptide having significant similarity to a polypeptide 

8 whose sequence is set forth in SEQ ID NO: 1, wherein a polypeptide is considered 

9 significantly similar if, when the amino acid sequence of the polypeptide is compared 

10 with the amino acid sequence of the polypeptide of SEQ ID NO: 1 using the BLAST 

1 1 algorithm and the BLOSUM substitution matrix with default parameters, the result is 

12 a % identity greater than 60 or a % positive greater than 70 encompassing at least 50% 

13 of the length of SEQ ID NO: 1, or both. 
14 

15 Ijj. A substantially purified polypeptide having significant similarity to a polypeptide 

16 whose sequence is set forth in SEQ ID NO: 1, wherem a polypeptide is considered 

17 significantly similar i( when the amino acid sequence of the polypeptide is compared 

1 8 with the amino acid sequence of the polypeptide of SEQ ID NO: 1 using the BLAST 

19 algorithm and Ihe BLOSUM substitution matrix with de&ult parameters, the result is 

20 a % identity greater than 60 or a % positive greater than 70 encompassing at least 75% 

21 of the length of SEQ ID NO:l, or both. 
22 

23 

24 Ik. A substantially purified polypeptide having significant similarity to a polypeptide 

25 whose sequence is set forth in SEQ ID NO: 1, wherein a polypeptide is considered 

26 significantly similar i^ when the amino acid sequence of the polypeptide is compared 

27 with the amino acid sequence of the polypeptide of SEQ ID NO: 1 using the BLAST 

28 algorithm and the BLOSUM substitution matrix with default parameters, the result is 

29 a % identity greater than 70 or a % positive greater than 80 encompassing at least' 25% 

30 of the length of SEQ.ID N0:1, or both. 
31 
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1 11. A substantially purified polypeptide having significant similarity to a polypeptide 

2 whose sequence is set forth in SEQ ED NO: 1, wherein a polypeptide is considered 

3 significantly similar if, when the amino acid sequence of the polypeptide is compared 

4 with the amino acid sequence of the polypeptide of SEQ ID NO:l using the BLAST 

5 algorithm and the BLOSUM substitution matrix with default parameters, the result is 

6 a % identity greater than 70 or a % positive greater than 80 encompassing at least 50% 

7 ofthe length ofSEQ ID N0:1, or both. . 
8 

9 1 11. A substantially purified polypeptide having significant similarity to a polypeptide 

10 whose sequence is set forth in SEQ ID NO: 1, wherein a polypeptide is considered 

1 1 significantly sunilar if, when the amino acid sequence ofthe polypeptide is compared 

12 with the amino acid sequence of the polypeptide of SEQ ID NO:l using the BLAST 

1 3 algorithm and the BLOSUM substitution matrix with de&ult parameters, the result is 

14 a % identity greater than 70 or a % positive greater than 80 encompassing at least 75% 

15 ofthe length ofSEQ ID NOrl, or both. 
16 

17 Im. A substantially purified polypeptide having significant similarity to a polypeptide 

18 whose sequence is set forth in SEQ ID NO: 1, wherein a polypeptide is considered 

19 significantly similar if, when the amino acid sequence of the polypeptide is compared 

20 with the amino acid sequence of the polypeptide of SEQ ID NO: 1 using the BLAST 

21 algorithm and the BLOSUM substitution matrix witii default parameters, the result is 

22 a % identity greater than 80 or a % positive greater than 90 encompassing at least 25% 

23 oftiie length of SEQ ID NO:l, or both. 
24 

25 In. A substantially purified polypeptide having significant similarity to a polypeptide 

26 whose sequence is set forth in SEQ ID NO: 1, wherein a polypeptide is considered 

27 significantly similar if, when the amino acid sequence of the polypeptide is compared 

28 with the amino acid sequence of the polypeptide of SEQ ID N0:1 using the BLAST 

29 algorithm and the BLOSUM substitution matrix with deJ&ult parameters, the result is 

30 a % identity greater ttian 80 or a % positive greater than 90 encompassing at least 50% 

31 of tiie length of SEQ ID N0:1, or both. 
32 
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1 Inn. A substantially purified polypeptide having significant similarity to a 

2 polypeptide wiiose sequence is set forth in SEQ ID NO: 1, wherein a polypeptide Is 

3 considered significantly similar if, when the amino acid sequence of the polypeptide is 

4 compared with the ammo acid sequence of Ifae polypeptide of SEQ E) NO: 1 using the 

5 BLAST algorithm and the BLOSUM substitution matrix with de&ult parameters, the 

6 result is a % identity greater than 80 or a % positive greater than 90 encompassing at 

7 least 75% of the length of SEQ ID NO:l, or both. 
8 

9 2. A purified and isolated polynucleotide comprising the polynucleotide sequence of 

10 SEQ ID NO: 2. 
11 

12 3. A purified and isolated polynucleotide comprising the polynucleotide sequence of 

13 SEQ ID NO: 3. 
14 

15 4. A purified and isolated polynucleotide comprising the polynucleotide sequence of 

16 SEQ ID NO: 3. 

17 . 

18 S. A purified and isolated polynucleotide having a sequence that is complementary to 

19 the polynucleotide sequence of any of claims 2, 3, or 4. 
20 

21 6. An isolated and purified polynucleotide encodmg a polypeptide whose amino acid 

22 sequence comprises the amino acid sequence of SEQ ID NO: 1 . 
23 

24 6a. An isolated and purified polynucleotide encoding a polypeptide having significant 

25 similarity to the polypeptide having a sequence set forth in SEQ ID NO: 1 , wherein a 

26 polypeptide having significant similarity the polypeptide having a sequence set forth 

27 in SEQ ID N0:1 is defined as m claim li, Ij, Ik, 11, Im, or In. 
28 

29 7. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

30 SEQ ID NO: 2 under stringent conditions. 
31 
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1 7a. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

2 SEQE)N0:3 under stringent conditions. 
3 

4 7b. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

5 SEQ ID NO:4 under stringent conditions. 
6 

7 7c. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

8 SEQ ID N0:4 under stringent conditions. 
9 

10 7d. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

1 1 SEQ ID NO:S under stringent conditions. 
12 

13 7e. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

14 SEQ ID NO: 2 under moderately stringent conditions. 
15 

16 7£ An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

1 7 SEQ ID NO:3 under moderately stringent conditions. 
18 

ft 

19 7g. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

20 SEQ ID NO:4 under moderately stringent conditions. 
21 

22 7h. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

23 SEQ ID N0:4 under moderately stringent conditions. 
24 

25 7i. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

26 SEQ ID NO:S under moderately stringent conditions. 
27 

28 7j. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

29 claim 6 under stringent conditions. 
30 

31 7k. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

32 claim 6a under stringent conditions. 
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1 

2 7i. An isolated and purified poiynucieotide that hybridizes to the polynucleotide of 

3 claim 6 under moderately stringent conditions. 
4 

5 7m. An isolated and purified polynucleotide that hybridizes to the polynucleotide of 

6 claim 6a under moderately stringent conditions. 
7 

8 1 0. An expression vector comprising the polynucleotide of claim 6. 
9 

10 10a. An expression vector comprising the polynucleotide of claim 6a. 
11 

12 1 1 . An expression vector comprising a polynucleotide that encodes the fragment of 

13 any of claims la, lb, or.lc. 
14 

15 12. An expression vector comprismg a polynucleotide that encodes the variant of any 

16 of claims Id, le. If, Ig, or Ih. 
17 

18 20. A host cell comprising the expression vector of claim 1 0. 

1 9 20a. A host cell comprising the expression vector of claim 1 Oa. 
20 

21 30. A method of producing a polypeptide comprising an amino acid sequence selected 

22 from the group consisting of SEQ ID NO: 1 and a polypeptide whose sequence has 

23 significant similarity to the amino acid sequence of SEQ ID NO: 1 , the method 

24 comprising the steps of: 

25 culturing the host cell of claim 20 under conditions wherein the polypeptide is 

26 ^pressed; and 

27 recovering the polypeptide fix)m the host cell culture. 
28 

29 50. A pharmaceutical composition comprising: 

30 the polypeptide of claim 1, or a polypeptide whose sequence has significant 

3 1 similarity to the amino acid sequence of SEQ ID NO: 1 ; and 

32 a pharmaceutically acceptable carrier. 
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1 

2 55. A pharmaceutical composition comprising: 

3 ■ the polynucleotide of claim 6; and 

4 . a pharmaceutically acceptable carrier. 
5 

6 60. A piirified antibody that specifically binds to the polypeptide of claim 1 . 
7 

8 65. A purified antibody that specifically binds to the polypeptide of SEQ ID NO: 1 . 
9 

10 65a. A purified antibody that specifically binds to a polypeptide having significant 

1 1 similarity to the polypeptide having a sequence set fordi in SEQ ID NO: 1, wherein a 

12 polypeptide having significant similarity the polypeptide having a sequence set forth 

13 in SEQ ID N0:1 is defined as in claim li, Ij, Ik, 11, Im, or In. 
14 

15 66. The antibody of claim 60 or claim 65, wherein the antibody is a polyclonal 

16 antibody. 

17 • 

1 8 67. The antibody of claim 60 or claim 65, wherein the antibody is a monoclonal 

19 antibody. 
20 

21 70. A pharmaceutical composition comprismg: 

22 the antibody of claim 60 or claim 65; and 

23 a pharmaceutically acceptable carrier. 
24 

25 80. A method for detecting a polynucleotide that encodes a polypeptide comprising an 

26 amino acid sequence set forth m SEQ ID N0:1 in a biological sample, the method 

27 comprising steps of: 

28 (a) hybridizing a nucleic acid complementary to the polynucleotide that 

29 encodes a polypeptide comprising an amino acid sequence set forth in SEQ ID NO: 1 , 

30 or that encodes a firagment or variant of the polypeptide, to at least one nucleic acid in 

3 1 the biological sample, thereby forming a hybridization complex; and 
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1 (b) detecting the hybridization complex, wherem the presence of the 

2 hybi idization complex indicates the presence of a poiynucieotide encoding the 

3 polypeptide in the biological sample. 
4 

5 8 L The method of claim 80, wherein the biological sample comprises breast cancer 

6 tissue or cells. 
7 

8 82. The method of claim 80, wherein the biological sample comprises normal breast 

9 tissue or cells. 
10 

11 85. A method for detectmg a polynucleotide that encodes a polypeptide comprising an 

12 amino acid sequence set forth in SEQ ID NO: 1 comprising steps of: 

1 3 (a) hybridizmg a nucleic acid that encodes a polypeptide comprising an amino 

14 acid sequence set forth in SEQ ID NO: 1 to at least one nucleic acid complementary to 

1 5 a nucleic acid in the biological sample, thereby forming a hybridization complex; and 

16 (b) detecting the hybridization complex, wherein the presence of the 

17 hybridization complex mdicates the presence of the polynucleotide in the biological 

18 sample. 
19 

20 86. The method of claim 85, wherem the biological sample comprises breast cancer 

21 tissue or cells. 
22 

23 87. The method of claim 85, wherein the biological sample comprises normal breast 

24 tissue or cells. 
25 

26 90. A method for detecting a polypeptide whose sequence comprises the amino acid 

27 sequence set fordi in SEQ JD NO: 1 in a biological sample comprising steps of: 

28 (a) contacting the biological sample with the antibody of claim 60 or claim 65; 

29 and 

30 (b) determining whether the antibody specifically binds to the sample, the 

3 1 bindmg being an indication that the sample contains the polypeptide. 
32 
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1 91. The method of claim 90, wherein the biological sample comprises breast cancer 

2 tissue or cells. 
3 

4 92. The method of claim 90, wherein the biological sample comprises a cell, tissue, 

5 blood, urine, serum, ascites, saliva, or another body fluid, secretion, or excretion. 
6 

7 93. The methodof claim 90, wherein the determming step comprises p^orming an 

8 enzyme-linked inununosorbent assay. 
9 

1 0 94. The method of claun 90, wherein the determining step comprises performing 

1 1 unmunobistochemistiy. 
12 

13 95. The method of claim 90, wherein the determining step comprises contacting an 

14 antibody array with the sample. 
15 

1 6 90a. A method for detecting a polypeptide having significant similarity to the 

1 7 polypeptide whose sequence is set forth in SEQ ID NO: 1 in a biological sample 

18 comprising steps of: 

19 (a) contacting the biological sample widi the antibody of claim 60 or claim 65; 

20 and 

21 (b) determming whether the antibody specifically bmds to the sample, the 

22 bmding being an indication that the sample contains the polypeptide. 
23 

24 91a. The method of clmm 90a, wherein the biological sample comprises breast cancer 

25 tissue or cells. 
26 

27 92a. The method of claun 90a, wherein the biological sample comprises a cell, tissue, 

28 blood, urine, serum, ascites, saliva, or another body fluid, secretion, or excretion. 
29 

30 93a. The method of claun 90a, wherein the determining step comprises performing an 

3 1 enzyme-linked immunosorbent assay. 

32 
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1 94a. The method of claun 90a, wherein the determining step comprises pfenning 

2 immunohistochemistiy. 
3 

4 95a. The method of claim 90a, wherein the determining step comprises contacting an 

5 antibody array with the sample. 
6 



7 100. A method for treating or preventing a disorder of cell proliferation, the method 

8 comprising administering to a subject in need of such treatment an efiective amount 

9 of the pharmaceutical composition of any of claims 50, 55, or 70. 
10 

11 1 50. A metiiod for classifying a disease comprising the steps of: 

12 (a) providing a sample from a subject; 

13 (b) detecting the presence of a polynucleotide that encodes a polypq)tide 

14 having the sequence of SEQ ID NO: 1 within the sample; and 

1 5 (c) assigning the disease to one of a set of predetermined categories based on 



1 6 detection of the polynucleotide. 
17 

18 151. The method of claim ISO, wherein the disease is cancer. 
19 

20 1 52. The method of claim 1 SO, wherein the disease is breast cancer. 
21 

22 153. The method of claim 150, wherein the detecting step comprises a nucleic acid 

23 amplification step. 
24 

25 154. The method of claim 150, wherein the sample comprises a cell, tissue, blood, 



26 urine, saliva, ascites, other body fluid, secretion, or excretion. 
27 

28 150a. A method for classifying a disease comprising the steps of: 

29 (a) providing a sample from a subject; 

30 (b) detecting the presence of a polynucleotide that encodes a polypeptide 

3 1 having significant similarity to a polypeptide whose sequence comprises the sequence 

32 of SEQ ID NO: 1 withm the sample; and 
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1 (c) assigning the disease to one of a set of predetermined categories based on 

2 detection of the polynucleotide. 
3 

4 ISla. The method of claim ISOa, wherein the disease is cancer. 
5 

6 lS2a. The method of claim ISOa, wherein the disease is breast cancer. 
7 

8 lS3a. The method of claim 150a, wherein the detecting step comprises a nucleic acid 

9 amplification step. 
10 

11 1 54a. The method of claim 1 SOa, wherein the sample comprises a cell, tissue, blood, 

12 urine, saliva, ascites, other body fluid, secretion, or excretion. 
13 

14 155. The method of claim 150, fiirther comprising the step of: 



1 5 providing diagnostic, prognostic, or predictive information based on the 

16 predetennined category assigned in the assigning step. 
17 

18 1 55a. The method of claim 1 50a, further comprising the step of: 

19 providing diagnostic, prognostic, or predictive information based on the 



20 predetermined categoiy assigned in the assigning step. 
21 

22 156. The method of claim 155 or claim 155a, wherein the disease is breast cancer. 
23 

24 1 60. A method for classifying a disease comprising the steps o£ 



25 (a) providing a sample from a subject; 

26 (b) detecting the presence of a polypeptide comprising the sequence of SEQ 

27 ID NO: 1 within the sample; and 

28 (c) assigning the disease.to one of a set of predetennined categories based on 

29 detection of the polynucleotide. 
30 

31 161. The method of claim 1 60, wherein the disease is cancer. 
32 
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1 162. The method of claim 160, wherein the disease is breast cancer. 

3 164. The method of claim 1 60, wherein the sample comprises a cell, tissue, blood, 

4 urine, saliva, ascites, other body fluid, secretion, or excretion. 
5 

6 1 65. The method of claim 1 60, fiirther comprising the step of providing diagnostic, 

7 prognostic, or predictive information based on the category assigned in the assigning 

8 step, 
9 

10 1 66. The method of claim 1 60, wherein the detecting step is performed using an 

1 1 antibody that specifically bmds to the polypeptide of SEQ ID NO: 1 . 
12 

13 166a. The method of claim 166, wherein the determining step comprises performing 

14 an enzyme-linked unmunosorbent assay. 
15 

16 166b. The method of claim 166, wherein the determining step comprises performing 

1 7 immunohistochemistiy. 
18 

19 166c. The method of claim 166, wherein the determining step comprises contacting an 

20 antibody array with the sample. 
21 

22 1 70. A method for classifying a disease comprising the steps of: 

23 (a) providing a sample from a subject; 

24 (b) detecting the presence of a polypeptide having significant similarity to a 

25 polypeptide comprising the sequence of SEQ ID NO: 1 within tfie sample; and 

26 (c) assignmg the disease to one of a set of predetermined categories based oh 

27 detection of the polynucleotide. 
28 

29 171. The method of claim 170, wherein the disease is cancer. 
30 

31 1 72. The method of claim 170, wherein the disease is breast cancer. 
32 
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1 174. The method of claim 170, wherein the sample comprises a cell, tissue, blood, 

2 urine, saliva, ascites, other body fluid, secretion, or excretion, 
3 

4. 175. The method of claim 170, further comprising the step of providing diagnostic, 

5 prognostic, or predictive information based on the category assigned in the assigning 

6 step. 
7 

8 176. The method of claim 170, wherein the detecting step is performed using an 

9 antibody that specifically binds to the polypeptide. 
10 

1 1 176a. The method of claim 1 76, wherem the determining step comprises performing 

12 an enzyme-linked immunosorbent assay. 
13 

14 176b. The method of claim 1 76, wherein the determining step comprises performing 

15 immunohistochemistry. 
16 

17 176c. The method of claim 176, wherein the determining step comprises contacting an 

1 8 antibody array with tiie sample. 
19 

20 1 80. A method for obtaining prognostic, diagnostic, or therapeutic information 

21 comprising steps of: 

22 (i) obtaining a sample containing cells or tissue from a subject; and 

23 (ii) detecting, within the sample, a mutation in a regulatory or coding region of 

24 a gene that encodes the BSTP-Ras/Rerg polypeptide of SEQ ID NO: 1 . 
25 

26 181. The method of claim 1 80, wherein the mutation is ui an estrogen receptor binding 

27 site. 
28 

29 1 82. The method of claun 1 80, wherem the mutation is in a site encoding a GTPase 

30 domain. 
31 

32 183. The method of claim 180, wherein the mutation is in an ER binding site. 
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1 

3 a polypeptide comprising the amino acid sequence of SEQ ID N0:1 comprising steps 

4 of: 

5 (i) contacting a polypeptide comprising the amino acid sequence of SEQ JD 

6 NO: 1, a polypeptide having significant similarity to the polypeptide whose amino acid 

7 sequence is set forth in SEQ ID NO: 1, or a fragment of any of these polypeptides 

8 with a test compound; 

9 (ii) measuring GTP-binding and/or GTPase activity of the polypeptide; and 
10 (iii) identifying the test compound as an activator of the polypeptide if the 



1 1 polypeptide or fragment exhibits enhanced GTP-binding activity and/or reduced 

12 GTPase activity in the presence of the test compound relative to the level of GTP-.* 

13 binding activity and/or GTPase activity in the absence of the test compound. 
14 

15 210. The mefliod of claim 200, wherein the polypeptide or fragment includes a GTP- 

16 binding domain. 
17 

1 8 220. A method of mhibitmg the growth of a cell comprising enhancing the level or 

19 activity of a polypeptide comprismg the amino acid sequence of SEQ ID NO: 1 or a 

20 polypeptide having significant similarity to the polypeptide of SEQ ID NO: 1 in the 

21 ceU. 
22 

23 225. A method of inhibiting the growth of a cell comprising treating the cell, or 

24 treating an organism contaming tiie cell, with a compound identified according to the 

25 method of claim 200. 
26 

27 226. The method of claim 220 or claim 225, wherem the cell is a tumor cell. 
28 

29 250. A method of treating or preventing a tumor comprising steps of: 

30 (i) providing an individual in need of treatment or prevention of a tumor; 

3 1 (ii) administering a compound that enhances the level or activity of a 

32 polypeptide whose sequence comprises the amino acid sequence of SEQ ID NO: 1 . 
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1 

2 300. A method of identifying a compound that reduces or inhibits the expression or 

3 activity of a polypeptide whose seqjtience comprises the amino acid comprising steps 

4 of steps of: 

5 (i) contacting a polypeptide comprising the amino acid sequence of SEQ ID 

6 NO: 1 , a polypeptide having significant similarity to the polypeptide whose amino acid 

7 sequence is set forth in SEQ ID NO: 1, or a fragment of any of these polypeptides 

8 with a test compound; 

9 (ii) measurmg GTP-binding and/or GTPase activity of the polypeptide or 

10 fragment; and 

1 1 (iii) identifying the test compound as an inhibitor of the polypeptide if the 

1 2 polypeptide or fragment exhibits decreased GTP-bindmg activity and/or increased 

1 3 GTPase activity in the presence of the test compound relative to the level of GTP- 

14 binding activity and/or GTPase activity in the absence of flie test compound. 
15 

16 305. The method of clakn 300, wherem the polypeptide or fragment comprises a 

17 GTP-bihdihg domain. 
18 

19 3 1 0. A method of uihibiting the growth of a cell comprising reducing the level or 

20 activity of a polypeptide comprising the amino acid sequence of SEQ ID NO: 1 ui the 

21 cell. 
22 

23 320. A method of increasing the rate of growth or division of a cell comprising 

24 treating the cell, or treating an organism containing the cell, with a compound 

25 identified according to the method of claim 300. 
26 

27 400. A diagnostic kit comprising: 

28 an antibody that specifically binds to the polypeptide of SEQ ID NO: 1 ; 

29 instructions for use of the antibody; 

30 and a control sample, wherein the antibody specifically binds to a polypeptide 

31 in the control sample. 
32 



130 



wo 02/08282 



PCT/USOl/23574 



1 60 1 , A method of classifying a tumor comprising the steps of: 

2 providing a tumor sample; 

3 detecting expression or activity of a gene encoding the polypeptide of SEQ ED 

4 NO: 1 in the sample; and 

5 classifying the tumor as belonging to a tumor subclass based on the results of 

6 the detecting step. 
7 

8 605. The method of claim 601, wherein Ac detecting step comprises detecting the 

9 polypeptide. 
10 

1 1 606. The method of claim 605, wherein the polypeptide is detected by performing 

12 immunohistochemical analysis on the sample using an antibody tiiat specifically binds 

13 to the polypeptide. 
14 

15 606a. The method of claim 605, wherein the polypeptide is detected by performing an 

16 ELISA assay using an antibody that specifically binds to the polypeptide. 
17 

18 606b. The method of claun 605, wherein the polypeptide is detected using an antibody 

19 array comprising an antibody that specifically binds to the polypeptide. 
20 

2 1 606c. The method of claim 605, wherein the detecting step comprises: 

• 22 detecting modification of a substrate by the polypeptide. 
23 

24 607. The method of claim 601, wherein classifying a tumor comprises: 

25 stratifying a subject having the tumor for a clinical trial. 
26 

27 608. The method of claim 607, wherein the tumor is a breast tumor. 
28 

29 609. The method of claim 60 1 , wherein the tumor is a breast tumor and the tumor 

30 subclass is a lumbal tumor subclass. 
31 

32 601a. The method of claim 601, further comprising: 
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1 providing diagnostic, prognostic, or predictive information based on the 

2 classifying step. 
3 

4 605a. The method of claim 605, fiirther comprising: 

5 providing diagnostic, prognostic, or predictive information based on the 



6 classifymg step. 
7 

8 606aa. The method of claim 605a, v^herein the polypeptide is detected by performing 

9 immunohistochemical analysis on the sample using an antibody that specifically binds 
10 to the polypeptide. 

11 

12 606ab. The method of claim 60Sa, wherein the polypeptide is detected by performing 

13 an ELISA assay using an antibody that specifically binds to the polypeptide. 
14 

15 606ac. The method of claim 605a, wherein the polypeptide is detected using an 

16 antibody array comprising an antibody that specifically binds to the polypeptide. 
17 

1 8 606ad. The method of claim 605a, wherein the detecting step comprises: 

1 9 detectmg modification of a substrate by the polypeptide. 
20 



21 609a. The method of claim 601a, wherein the tumor is a breast tumor and the tumor 

22 subclass is a luminal tumor subclass. 
23 

24 601g. The method of claim 601 , fiirther comprising: 

25 selecting a treatment based on the classifying step. 
26 

27 605g. The method of claim 605, fiirther comprising: 

28 selecting a treatment based on the classifymg step. 
29 

30 606ag. The method of clahn 605g, wherein the polypeptide is detected by performing 

31 immunohistochemical analysis on the sample using an antibody that specifically binds 

32 to the polypeptide. 
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1 

2 606bg. The method of claim 605g, wherem the polypeptide is detected by perfonning 

3 an ELISA assay using an antibody that specifically binds to the polypeptide. 
4 

5 606cg. The method of claim 605g, wherein the polypeptide is detected using an 

6 antibody array comprising an antibody that specifically binds to the polypeptide. 
7 

8 606dg. The method of claim 605g, wherein the detecting step comprises; 

9 detecting modification of a substrate by the polypeptide. 
10 

1 1 609g. Tlie method of claun 601g, wherein the tumor is a breast tumor and the tumor 

12 subclass is a luminal tumor subclass. 
13 

14 601m. A method of testing a subject comprismg the steps of: 

1 5 providing a sample isolated firom a subject; 

1 6 detecting e;q)res5ion or activity of a gene encoding flie polypeptide of SEQ ID 

17 NO: 1 in the sample; and 

1 8 providing diagnostic, prognostic, or predictive information based on the 

19 detecting step. 
20 

21 60Sm. The method of claim 601m, wherein the detecting step comprises detecting the 

22 polypeptide. 
23 

24 606m. The method of claim 60Sm, wherein the polypeptide is detected by performing 

25 immunohistochemical analysis on the sample using an antibody that specifically binds 

26 to the polypeptide, 
27 

28 606ma. The method of claim 605m, wherein the polypeptide is detected by 

29 performing an ELISA assay using an antibody that specifically binds to the 

30 polypeptide. 
31 
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1 606mb. The method of claim 605m, wherein the polypeptide is detected using an 

2 antibody array comprising an antibody that specifically binds to the polypeptide. 
3 

4 606mc. The method of claim 605m, wherein the detecting step comprises: 

5 detecting modification of a substrate by the polypeptide. 
6 

7 609m. The method of any of claim 601m, wherein the sample is selected fiom the 

8 group consisting of: 

9 a blood sample» a urine sample, a serum sample, an ascites sample, a saliva 



10 sample, a cell, and a portion of tissue. 
11 

12 610m. The method of claim 601m, wherein the sample is a tumor sample. 
13 

14 61 Im. The method of claim 610m, wherein the tumor sample is a breast tiimor 

15 sample. 
16 

17 601r. A method of testing a subject comprising the steps of: 



1 8 providing a sample isolated from a subject; 

1 9 detecting expression or activity of a gene encodmg the polypejptide of SEQ ID 

20 N0:1 in the sample; and 

21 stratifying the subject for a clinical trial based on the detecting step. 



22 

23 60Sr. The method of claim 601r, wherein the detecting step comprises detecting the 

24 polypeptide. 
25 

26 606r. The method of claim 605r, wherein the polypeptide is detected by performing 

27 immunohistochemical analysis on the sample using an antibody that specifically binds 

28 to the polypeptide. 
29 

30 606ra. The method of claim 605r, wherein the polypeptide is detected by performing 

31 an ELISA assay using an antibody that specifically binds to the polypeptide. 
32 
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1 606rb. The method of claim 605r, wherein the polypeptide is detected using an 

2 antibody array comprising an antibody that speciticaUy binds to the polypeptide. 
3 

4 606rc. The method of claun 605r, wherein the detecting step comprises: 

5 detecting modification of a substrate by the polypeptide. 
6 

7 609r. The method of claim 601r, wherein the sample is selected Scorn the group 

8 consisting of: 

9 a blood sample, a urine sample, a serum sample, an ascites sample, a saliva 
1 0 sample, a cell, and a portion of tissue. 

11 

12 610r. The method of claim 601r, wherein the sample is a tumor sample. 
13 

14 611r. The method of claim 610r, wherein the tumor sample is a breast tumor sample. 
15 

16 601q. A method of testing a subject comprismg the steps of: 



1 7 providing a sample isolated fix>m a subject; 

1 8 detecting expression or activity of a gene encoding the polypeptide of SEQ ID 

19 NO:l in the sample; and 

20 selecting a treatment based on the detecting step. 



21 

22 605q. The method of claim 601q, wherein the detecting step comprises detecting the 

23 polypeptide. 
24 

25 606q. The method of claun 605q, wherein the polypeptide is detected by performing 

26 immunohistochemical analysis on the sample using an antibody that specifically hinds 

27 to the polypeptide. 
28 

29 606qa. The method of claim 605q, wherein the polypeptide is detected by performing 

30 an ELISA assay using an antibody that specifically binds to the polypeptide. 
31 
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1 606qb. The method of claim 605q, wherein the polypeptide is detected using an 

2 antibody array comprising an antibody that specifically binds to the polypeptide. 
3 

4 606qc. The method of claim 60Sq, wherein the detecting step comprises: 

5 detecting modification of a substrate by the polypeptide. 
6 

7 609q. The method of claim 60Iq, wherein the sample is selected from the group 

8 consisting of: 

9 a blood sample, a urine sample, a serum sample, an ascites sample, a saliva 
1 0 sample, a cell, and a portion of tissue. 

11 

12 610q. The method of claim 601q, wherein the sample is a tumor sample. 
13 

14 61 Iq. The method of claim 610q, wherein the tumor sample is a breast tumor sample. 
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Figure 1 

The amino acid sequence of BSTP-RAS/RERG (SEQ ID NO: 1) is presented below. 

MAKSAEVKLAIFGjRAGVGKSALVVRFLTKRPIWEYDPTLESTYRHQATIDDE 

GHMRWGEGPVLVYDITDRGSPEEVLPLIOlIILDEIKKPKim^ 

APYECSACTGEGNITEIFYELCaiEVRRRRMVQGKTRRRSSTTHVKQM 



The consensus nucleotide sequence (SEQ ID N0:2) of an open reading frame that 
encodes BSTP-Ras/Rerg is presented below. 

ATGGCTAAAAGTGCGGAGGTCAAACTGGCAATATTTGGGAGAGCA.GGCGTGGGC:^ 

GAGATTTCTGACCyyU^CGGTTCATCTGGGAATATGATCCCACCCTCGAATCAACCTACCGAC^^ 

CCATCGATGATGAAGTTGTTTCCATCGAGATACTAGACACTGCTGGTOVGGAAGATACCA^ 

GGGCACTlTGCGATGGGGGGAAGGCTTTGTGCTGGTCTACGACATTACTGACCGAGGAAGTm 

GCTGCCACTTAAGAACATCCTAGATGAGATCAAAAAGCCCAAGAATGTGACTC^ 

AAGCTGACTTGGACCACTCCAGGCAGGTTAGCACAGATiGAAGGAGAGAAGCTGGCCACAG^ 

GCTTTTTACGAGTGCTCTGCCTGCACTGGAGAAGGGAACATCT^CAGAGATATTCTATGAATTGTGTCGAGA 

GGTGCGTCGCCGGAGGATGGTGCAGGGCAAGACGAGGCGACGG^GCTCCACCACGCATGTCAAGCAAGC^ 
TTAACAAGATGCTCACCAAAATCAGTAGT 



The nucleotide sequence of I.M.A.G.E. clone 503581 (SEQ ID NO: 3), which was used 
in the microarray experiments described herein and includes a portion that encodes 
BSTP-Ras/Rerg (SEQ ID NO: 1) is presented below. 



A ArSA ATTP/Trir'A/'VIA /^ri/^'r/-'/T.A/-l A /-i/-<'r/-i/-i/-i>»>^/-<^ - . . _. 

CGACGCAGCTCCACCACGCATGTCAAGCAAGCCATTAACAAGATGCTCACCAAAATCAGT^^ 

TTAGGCAGCCCAGCTGAGGTGGACCAACTAATTGGAAACACIU^ 

AAAAATAAAACAAAATATTGCATTCTTTGTTTGGATTCrGAGAAATGTCTOG 

mnXKSCCTCTAATAGGTTGGGAAGTTTTAGCGTGTTTTATXK^ 

TTCCirrCCTGCTTGAATAAGATACACTCTAATGGCATTTCAACATGTAAT^^ 

GAAATGACTGGmATGTTAAGCTATTmAGGCATClTCACXnTGCTTrAAG 



wo 02/08282 



2/10 



PCT/USOl/23574 



TmGCAAAGGCATTTAAAAATTCAATTTCTrGTCAGA^^ 

TAAGATAGCAGAAAATACAGTAAAAACACAGGAGAAGAAGCTGAGCTATTGGAACAGGG^ 

ATAGAAGGAACTCTAGTTTCTGTTTGAAGTGAGGATTITCT^ 

TTTCmAAAATTITATTTTGT^ 

AGAGAATTCATTTCAATTTGAGTrGGTrCTCTCAA^ 

CCTTCCTTGGCTCAAACAATATTTTCCCTAT^ 

TGCTGGGCXnTmTCATCAAGTCAG^^ 

CrGACTTTATTGAATGCTAGTAGAAGATGTAGAATTAGAGACATCT^ 

GCAGAAAAACCACAGTCCAAAAGACAAGCAAATTAAGAATGGAGCITAACCATGCCT^ 

GGGAAGTYTAGACTTTGAGCCAGGTACAGTAAGAAAAATTAGCC^ 

CACATGACITATTTTGATATTTTGGATACATTAACT^ 

GTGATTAAAGTTCATTACAGCTGAATAAATGTGTYTAAAACAGACl^ 

CAGTYTACAACTGATAAAACCTTATGATTC^^ 

ATTTGGGTTACITrATITrAGTAGAAAAATATATATO 

GCATAGAATATATGTTATAAAAATTTGTTAAGCCGTTTAGCGGG^ 

CATTTTGGGAACAACTTTGNAATGTWAO^ 

TAATTTOHKATAAAAAAAAAAAAAAAAAAAAAAAA^^^ 
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The consensus nucleotide sequence (SEQ ID N0:4) of LMAG.E. clones 503581 and 
28777 is presented below. 



ttcc aaga ttcacaatgatatggtgaatccaaagactggaaccaaaaagatttact^ 

ttagttttaacaacagtaaattgtctaccaacacccatcatggcta^ 

ctggcaatatttgggagagcaggcgtgggcaagtcagctcitgtagtgagam 

cggttcatctgggaatatgatcccaccctcgaatcaacctaccgacaccaagcaaccatcgat 

gatgaagttgtttccatggagatactagacactgctggtcaggaagataccattcagaggga 

ggggcacatgcgatggggggaaggctttgtgctggtctacgacattactgaccgaggaagt^ 

ttgaggaagtgctgccacttaagaacatcctagatgagatcaaaaagcccaagaatgtgact 

ctcatcttggttggaaacaaagctgacttggaccactccaggcaggttagcacagaagaagg 

agagaagctggccacagaattggcttgtgctrtttacgagtgct 

gaacatcacagagatattctatgaj^ttgtgtcga.gaggtgcgtggccgga.ggatggtgcagg 

gcaagacgaggcgacgcagcrccaccacgcatgtcaagcaagc^^^ 

aaaatcagtagttaggcagcccagctgaggtggaccaactaattggaaacactot 

tgtrcccctttcaaaaataaaacaaaatattgcattcm 

GCTTCCCATTGTTTCTGGCCTCTAATAGGTTGGGAAGTTTTAGCGTGT^ 
GCTAACAATTTCTITCCriTrCCTGCTTGAAT^ 

accagagattctgaaatgactggtttatgttaagctatttttaggcatot 
gtaggttgaagtttttgcaaaggcatttaaaaattcaam 
ttcttaaaagtctaagatagcagaaaatacagtaaaaacacaggagaagaagctg 
ggaacagggaaatagaaggaactctagtttctgtttgaagtgaggat^ 

TATCATCTAGGTTTTCTTTAAAATTTTAT^ 

TTTCACTATAACAGAGAATTTCATTTCAATTTGAGTTGGTTC^ 

ACACCCTAACIUrCCTT(XTTGGCTCAAACAATAT^ 

AAAATTCACATCCCTGCTGGGCCTTTTTTCATCAAGTCAGGGTGATATA>^^ 

CTTTTCACCAAACCCTGACTITATTGAATGCTAGTAGAAGATGTAGA^ 

TrGTTTATCACCTTAGCAGAAAAACCACAGTCCAAAAGACAAGCAAATTA^ 

AACCATGCCTCCATTGGGAAGTBTAGACmGAGCCAGGTACAGTAAGAAAAAI^^ 

ATTCATTAAGTTrGCCACATGACTTATTTTGATATm 

TCAGAAAAGAATGGGTGATTAAAGTTCATTACAGCTGAATAAATGTGTCTAAAACAGACT 

GTATTCTGAAAGTACAGTCTACAACTGATAAAACCTTATGATTCTm 

CTATATATATCAAGATTTGGGTACTTTATTTTAGTAGAAAAT^^ 

TATTTATAAATGCATAGATATATGTATAAAAATTTGTAAGCGTTAGCGGCATTAAT^^ 

TGCATTTGGACAACTTGATGTAACTGACTTTATTTTATGTGAC^^ 

CTCATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



wo 02/08282 



4/10 



PCT/USOl/23574 



Figure 2 

A full length cDNA (2298 bp) encoding the polypeptide of SEQ ID N0:1, i.e., BSTP- 
Ras/Rerg is presented below as SEQ ID N0:5. The open reading frame extends from 
nucleotides 341 to 940. 

AGCGGAGTGCCTCCGCCGCTAGTCGGAGCX3AGCGCGAGTGAGGAGACCCCX:GCCGGGCCACT 

GGCACTTGCnTCTGCGGCGAGTCCCACCCACGACCGCAGCCCAGCAACTCGCAAAC^^ 

GAAGCCTGGGCTGCGCAGTGTGGGAGGGCTTCGCGATCnTGGGGGACCCATrCCGA^ 

GAGGACCGTAGCTCTCCrGGCCTGGAGAGTGTGAACAGGATTGTGGACTC^ 

AATGATATGGTGAATCCAAAGACTGGAACCAAAAAGATTTACTCAGTGCTTTAGT^ 

CAGTAAATTGTCTACCAACACCCATCATGGCTAAAAGTGCGGAGGTCAAACTGGCAATAm 

GGAGAGCAGGCGTGGGCAAGTCAGCnxnTGTAGTGAGATTTCTGACCA^ 

AATATGATCCCACCCTCGAATCAACCTACCGACACCAAGCAACCATCGATGATGAAGTTGm 

CCATGGAGATACTAGACACTGCTGGTCAGGAAGATACCATTCAGAGGGAGGGGCACATGCGA 

TGGGGGGAAGGCTTTGTGCTGGTCTACGACATTACTGACCGAGGAAGTTTTGAGGAAGTC^ 

CCACHTAAGAACATCCTAGATGAGATCAAAAAGCCCAAGAATGTGACTCTCATOT 

AACAAAGCTGACITGGACCACTCCAGGCAGGTTAGCACAGAAGAAGGAGAGAAGCTGGCCA 

AGAATTGGCTTGTGCTTrTrACGAGTGCTCTGCCn^^ 

ATTCTATGAATTGTGTCGAGAGGTGCGTCGCCGGAGGATGGTGCAGGGCAAGACGAGGCGAC 

GCAGCrCCACCACGCATGTCAAGCAAGCCATTAACAAGATGCTCACCAAAATCAGTAGTTAG 

GCAGCCCAGCTGAGGTGGACCAACTAATTGGAAACACTCTTCCCCITCT 

AATAAAACAAAATATTGCATTCTITGTTTGGATTCTGAGAAATGTCT 

TGGCCTCTAATAGGTTGGGAAGTTTTAGCGTGTTTTATGCAAT^ 

CTTTCCTGCTTGAATAAGATACACTCTAATGGCATTTG^^ 

AATGACTGGTTTATGTTAAGCTATTTTTAGGCATCTTC^^ 

TTGCAAAGGCATTTAAAAATTCAATTTCITGTCAGATACfA 

AGATAGCAGAAAATACAGTAAAAACACAGGAGAAGAAGCTGAGCTATTGGAACAGGGAAAT 

AGAAGGAA CTCTA GTTTCTGTITGAAGTGAGGATTTTCTGA^ 

TCnTTAAAATTTTATTTTGTTCITCAGT^ 

AGAATTTCATTTCAATTTGAGTTGGTTCTCTCAATGAT(^^ 

CTTOCTrG GCTCAAA CAATATTTTCCCTATAACAAAGC^ 

TGCT GGGC CTmrrCATCAAGTCAGGGTGATATAAAAACATTGG^ 

CTGACTTTATTGAATGCTAGTAGAAGATGTAGAATTAGAGACATCTGATTTGT^ 

GCAGAAAAACCACAGTCCAAAAGACAAGCAAATTAAGAATGGAGCTTAACCATGCCTCCAT^ 

GGGAAGTCrAGACnTTGAGCCAGGTACAGTAAGAAAAATTAGCCTCTGA^^ 

CACATGAOTATmGATATTTTGGATACATTAACTCAOT 

GTGATTAAAGTTCATTACAGCTGAATAAATGTGTCTAAAACAGACrCTTGTATrC^^ 

CAGTCTACAACTGATAAAACCTTATGATTCTTT^ 

ATTTGGGTACTITATTTTAGTAGAAAATATATATCTm 

TAGATATATGTATAAAAATTTGTAAGCGTTAGCGGCATTAATTCACCAATGCATT^ 

TT^GATGTAACTGACTTTATTTTATGTGACTATAATAAAAAG^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 6 
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Figure 8 
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